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Figure 1A 

Restriction enzyme analysis of CPN100686 (RY 54 - SEQ ID NO. 1) 

Alwl 
Fnu4HI 
Taul 
Acil 
MspAlI 
Dpnl | 

Hpyl78III Cjel| III TspRI Cjel 
Acil Mnll |Sau3Al|| |||BtsI | Sspl 

I I I III III I 1 I 
ATGGACTTCCGCATATTGTCAGGAGGGGATCAGCGGCACTGCTAATGGACAATATTCTGC 
1 + + + + + + 

TACCTGAAGGCGTATAACAGTCCTCCCCTAGTCGCCGTGACGATTACCTGTTATAAGACG 



CjePI 
CviRI | 
I I 



60 



Taal CviJI 

BsaJI | Fokl BsmFI | Bbvl 

BstDSlj Sfcl Taal Fnu4HI | j Dral 

Cjel || Bed CviJI | Cjel CjePI | Tsel | j j Msel | 

III I II I II I I I I II 

AAACCGTGGATGGCGTATGGCTGTAGTGATTGACGGTTATATGGTCAGCAGCCCTATTTT 

TTTGGCACCTACCGCATACCGACATCACTAACTGCCAATATACCAGTCGTCGGGATAAAA 



120 



ApOl 
Tsp509I 
BsrI BsmAI I 

MslI Ddel | | 
Maell NlaIIl| TspRI j j 

I II I I I I I 

AAACGTCCCATTGAAAAATCATGCCAGTGTCTCAGGGAAATTTACCCACCGTGAAGTGAG 

121 + + + + + + 180 

TTTGCAGGGTAACTTTTTAGTACGGTCACAGAGTCCCTTTAAATGGGTGGCACTTCACTC 



BseMII Taal 



Hpyl88IX 
Ddel | 
I I 



Hpyl78III 
BseMII 
Mnll 
Dral 
HaelV 
Hin4I 
Msel | 
II 



BsmAI 
BsmBI 
Earl 
Ddel 
Sthl32I 
Bpml 
BsaJI 
Hpyl78III 
Aval | 
Mnll | | 

I I I 



CAAACTCGCCTCAGATTTAAAATCTGGAGCGATGTCTTTTGTTCCCGAGGTTCTCAGTGA 

181 + + + + + + 

GTTTGAGCGGAGTCTAAATTTTAGACCTCGCTACAGAAAACAAGGGCTCCAAGAGTCACT 



240 



u 
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% 



Dpnl 
Earl 
Sau3AI 
Hpyl88IX 
MboII 
Dpnl 



BseMII 
Hin4I 
Sau3AI 
MboII | 
TspRI | | 

I 



% 



a 



BplI 



Rsal 
BsrGI | 
TatI 



Ddel 
I 



Nlalll 
Nspl 
SphI 
Cac8I I 



AGAGACGATCTCTTCTGATCTTGGGAAAAAACAATGTACACAAGGCATTATCTCAGCATG 

241 + + + + + + 

TCTCTGCTAGAGAAGACTAGAACCCTTTTTTGTTACATGTGTTCCGTAATAGAGTCGTAC 



300 



Acelll 
Sthl32I 

BseMII Mnll Hin4I | 

Cvi JI BsrDI Hgal | BsaHI | | 

II II II I 

CTGTGGCTTGGCAATGCTTATTGTTTTGATGAGCGTATATTATAGATTTGGAGGCGTCAT 

301 + + + + + + 360 

GACACCGAACCGTTACGAATAACAAAACTACTCGCATATAATATCTAAACCTCCGCAGTA 



Acelll 
Bbvl 
Taal 
SfaNI 
Sf cl 



Alul 
CviJI 
MboII | 
Mwol | 
Hpyl78IIl| | 
II I 



Hinf I 
Tf il 
Hpyl8 8IX | 

I I 



Alul 
CviJI 
Fnu4HI | 
Tsel | j 
Cjel | | | 
I II I 



CGCTTCGGGAGCTGTTCTTCTGAATCTTTTGCTTATCTGGGCAGCTCTACAGTATTTGGA 

361 + + + + + + 420 

GCGAAGCCCTCGACAAGAAGACTTAGAAAACGAATAGACCCGTCGAGATGTCATAAACCT 



Hhal 
HphI 



Hinf I 
Hpyl78III 
Plel | 
Cjel | | 
Fokl | j j 

III I 



CviJI 
Haelll 
Bed 
Eael 
Gdill 
SfaNI | 



Bcefl SfaNI | | Mwol 

II I I I I I I I I I I 

TGCGCCACTCACCTTGTCAGGACTCGCTGGGATTGTTCTTGCTATGGGGATGGCCGTAGA 

421 + + + + + + 480 

ACGCGGTGAGTGGAACAGTCCTGAGCGACCCTAACAAGAACGATACCCCTACCGGCATCT 
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Hpyl88IX 
Mnl I | 
NspV Hinfl| j Apol 



BsmAI Msel 



Fokl| TaqI Tfil| |Tsp509I 

II I II I I II 

GCAAATGTTCTTGTATTCGAAAGAATCCGAGAGGAATTTTTATTGTCTCAAAGTCTTAA 

481 + + + + + + 

ACGTTTACAAGAACATAAGCTTTCTTAGGCTCTCCTTAAAAATAACAGAGTTTCAGAATT 



540 



CviJI CviJI 
BsaJI | NlaIV| Hinfl 

Sfcl Styl j Mwol | | Tfil Sfcl 

I I I I II I I 

AAAATCTGTAGAAAAAGGATATACCAAGGCTTTTGGAGCCATTTTTGATTCTAACTTGAC 

541 + + + + + + 600 

TTTTAGACATCTTTTTCCTATATGGTTCCGAAAACCTCGGTAAAAACTAAGATTGAACTG 

BbvCI 
BpulOI 

Ddel CviJI 

CviJI | BseMII Haelll BslI 

Hael | Mnll | EcoO109I | EcoNI | 

Taal Haelll j MboII | | Bfal Sau96I j Msel j 

I II I I I I I I I I 

TACAGTATTGGCCTCAGCACTTCTTTTCTTCCTAGATACAGGGCCTATTAAAGGGTTTGC 

601 + + + + + h 660 

ATGTCATAACCGGAGTCGTGAAGAAAAGAAGGATCTATGTCCCGGATAATTTCCCAAACG 



Apol 
Tsp509I 
MboII | 
Beef I | 

Apol Nlalll 
MboII Hpyl78III | 

Tsp509I Earl CviJI Real | | 

I I I I I I 

TTTGACATTGATTTTAGGAATTTTCTCTTCAATGTTTACGGCTCTTTTCATGACTAAATT 

661 + + + + + + 720 

AAACTGTAACTAAAATCCTTAAAAGAGAAGTTACAAATGCCGAGAAAAGTACTGATTTAA 



Ndel 

Fokl CviRI | 

Nlalll SimI | Taal | j XmnI 

I II III I 

TTTCTTCATGCTGTGGATGAATAAGACCCAACATACACAGTTGCATATGATGAATAAGTT 

721 + + + + + + 780 

AAAGAAGTACGACACCTACTTATTCTGGGTTGTATGTGTCAACGTATACTACTTATTCAA 
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Figure ID 



Hpyl78III 
Smll 
Mnll | 
SfaNI | 
Nlalll | j 
I I I 



Hpyl78III 
CviJI | 
Bce83I | | 
CviRI Fokl | | j 

I II I I 

CGTGGGGATAAAGCATGATTTCTTGAGAGGATGCAAAAAACTTTGGGCTGTTTCTGGAAG 

781 + + + + + + 840 

GCACCCCTATTTCGTACTAAAGAACTCTCCTACGTTTTTTGAAACCCGACAAAGACCTTC 



Apol 
EcoRI 
Tsp509I 
ScrFI 
CviJI | 
EcoRII | 

Sthl32I Aval NlaIV| | 

II III 
TGTTTTTCTTTTAGGTTGCGTTGCTCTCGGGTTTGGAGCCTGGAATTCCGTTTTGGGAAT 

841 + . + + + + + 900 

ACAAAAAGAAAATCCAACGCAACGAGAGCCCAAACCTCGGACCTTAAGGCAAAACCCTTA 



Dral 
Msel | 

Mnll | Msel Nlalll SfaNI 

III III 
GGATTTTAAAGGAGGGTATGCCTTTACCTTTAATCCAAAAGAGCATGGCATCAGCGATGT 

901 + + + + + + 960 

CCTAAAATTTCCTCCCATACGGAAATGGAAATTAGGTTTTCTCGTACCGTAGTCGCTACA 



CviRI 



Sf cl 



I 



MboII 
Alul I 
CviJI | 



Hpyl7 8III 
Bfal | 
Xbal | | 
BsmAI | j j 
I III 



TGCTCAAATGCGTGGCAAAGTTGTGCATAAACTACAGGAAGCTGGTCTTTCTTCTAGAGA 

961 + + + + + + 1020 

ACGAGTTTACGCACCGTTTCAACACGTATTTGATGTCCTTCGACCAGAAAGAAGATCTCT 



BsaBI 
Dpnl 
Sau3AI 
Alwl 
Hpyl8 8IX 



Tthlllll 
Dpnl 
BstYI | 
Sau3AI j 
Eco57I MboII | | 

I III 



Alul 
CviJI 
Hindlll | 
I I 



CTTCCGTATTCAAACATTTGGATCTTCAGAAAAGATCAAAATCTATTTTAGTGATAAAGC 

1021 + + + + + + 1080 

GAAGGCATAAGTTTGTAAACCTAGAAGTCTTTTCTAGTTTTAGATAAAATCACTATTTCG 
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J 

Cac8I 
RleAI 
Alul 
CviJI 
Nlalll 
Hpyl78III 
Real 
BplI 



Alul 
CviJI 
Msel | 



Ddel 
I 



Hin4I 
CviJI I 



Dpnl 
Sau3AI 
Msel 
Acelll | 
Tsp509I | j 
Mnll | | j 
I II 



II I II I I I I Mill III 

TTTAAGCTATACTAAGCAGATACGAGCCTCTCTCCTAAAATTAACGATCATGAGCTGGCG 

1081 + + + + + + 1140 

AAATTCGATATGATTCGTCTATGCTCGGAGAGAGGATTTTAATTGCTAGTACTCGACCGC 

Bfal 
CviJI 
Hael 
Haelll 
Hpyl88IX StuI 
I I 

TTATTGTGGGATTGTTGTCAGAAACAGGCCTAGATTTCTCTACGGAAACTCTAAACGAAA 

1141 + + + + + + 1200 

AATTACACCCTAACAACAGTCTTTGTCCGGATCTAAAGAGATGCCTTTGAGATTTGCTTT 



Bcgl 

Apol Fnu4HI Bbvl Sthl32I 

Tsp509I Tsel | TaqI | MboII |BcgI 

I II II III 

CGCAAAATTTTGGTCAAAGGTAAGCAGCAAACTATCGAAGAAAATGCGTTATCAGGCGAC 

1201 + + + + + + 1260 

GCGTTTTAAAACCAGTTTCCATTCGTCGTTTGATAGCTTCTTTTACGCAATAGTCCGCTG 



Alul 

Bed CviJI CviJI Hhal 

III ' I 

CATCGGGCTTTTAGGAGCTTTGGCAATCATCTTGCTCTATGTGAGTTTGCGCTTTGAATG 

1261 + + + + + + 1320 

GTAGCCCGAAAATCCTCGAAACCGTTAGTAGAACGAGATACACTCAAACGCGAAACTTAC 



CviJI 
I 
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Figure IF 

Nlalll 
Hpyl78III 
Tsp509I 
Msel | 

TspRI Hhal | | 
Beef I Mwol |MwoI | j | Real 

I I I I I II I , , 

GCAATATGCTTTCAGTGCCGTATGCGCTTTAATTCATGACCTTTTGGCTACCTGTGCAGT 

1321 + + + + + + 1380 

CGTTATACGAAAGTCACGGCATACGCGAAATTAAGTACTGGAAAACCGATGGAGACGTCA 

CviJI 

Apol Cac8I 
Bsgl Tsp509I MboII CviRI | 

I II II I I 

CTTGTTTATAGCACATTTCTTTTTGAAGAAAATTCAAATAGATTTGCAAGCCATTGGTGC 

1381 + + + + + + 1440 

GAACAAATATCGTGTAAAGAAAAACTTCTTTTAAGTTTATCTAAACGTTCGGTAACCACG 

Dpnl 

Bell | Dpnl 
Msel Taal Msel Sau3AI | Sau3AI |Hpyl78III 

II I I I I I I 

TTTAATGACTGTATTGGGGTATTCATTAAACAATACTTTGATCATTTTTGATCGTATTCG 

1441 + + + + + + 1500 

AAATTACTGACATAACCCCATAAGTAATTTGTTATGAAACTAGTAAAAACTAGCATAAGC 



CviRI 
Mwol | 
I I 



I 

| Mwol 



Sf aNI 
Nlalll 
Nspl 

Dpnl Nsil | 

Sau3AI | MboII CviRI | | | Msel 

II I I I I I I 

TGAAGATCGCCAAGCGAACCTGTTTACCCCTATGCATGTTTTAGTTAATGATGCCCTTCA 

1501 + + + + + + 1560 

ACTTCTAGCGGTTCGCTTGGACAAATGGGGATACGTACAAAATCAATTACTACGGGAAGT 



Acil 
Fnu4HI 

Taul MslI Alul 
Maell CviJI | Taal | CviJI Msel 

I II II I I 

AAAGACGTTCAGCCGCACGGTAATGACAACAGCTACAACTCTATCAGTTTTGTTAATGCT 

1561 + + + + + + 1620 

TTTCTGCAAGTCGGCGTGCCATTACTGTTGTCGATGTTGAGATAGTCAAAACAATTACGA 



NlalV 
CviJI 
Fnu4HI | 
Taul j 
BseRI Acil | | 
I II I 



Mnll 
Tsp509I 
Msel | 

II 



CjePI 



CviRI 



Hinf I 
MboII Tfil 
I I 



TTTGTTTATAGGCGGCTCCTCTGTCTTTAATTTTGCATTTATTATGACCATAGGGATTCT 
1621 + + + + + + 1680 



AAACAAATATCCGCCGAGGAGACAGAAATTAAAACGTAAATAATACTGGTATCCCTAAGA 
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Figure 1G 



BsmAI Avail 
Bfal CjePI BsmBI CviRI Mnll Sau96I 

I ill II 

TCTAGGAACTTTATCGTCTCTTTATATTGCACCACCTCTGTTGTTGTTTATGGTCCGTAA 

1681 + + + + + + 1740 

AGATCCTTGAAATAGCAGAGAAATATAACGTGGTGGAGACAACAA.CAAATACCAGGCATT 

Msel 

Taal | Afllll 

Rsal | | Msel Maell 

III I I 

AGAAAATCGCTCAAAATAAGTACCGTTAAACTTAATCTAACGTGTAGCAATATAAAAATC 

1741 + + + + + + 1800 

TCTTTTAGCGAGTTTTATTCATGGCAATTTGAATTAGATTGCACATCGTTATATTTTTAG 



BsmFI 



PshAI 
I 



NlalV 
CviJI 
Haelll 
Eco0109I | 
Sau96I | 
BsmFI | | 

I II 



Apol 
Tsp509I 
Msel | 
I I 



Hpyl8 8IX 
Apol | 
Tsp509I j 
I I 



TCCTTTGGGACTTTAGTCCCAAAGGCCCCTGTGGTATTAAATTTATGACAAATTCAGATA 

1801 + + + + + + 1860 

AGGAAACCCTGAAATCAGGGTTTCCGGGGACACCATAATTTAAATACTGTTTAAGTCTAT 



ATGC 

1861 1864 

TACG 
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igure 2A 

Restriction enzyme analysis of CPN100696 (RY 55 - SEQ ID NO 



2) 



Msel 

Apol Pad 
Tsp509I Vspl 
Dral Bed Msel |Tsp509I [ 

Msel|CviJI | Tsp509I | | Msel | j Bfal 

II I I I I I II I I 

TTATTTTAAAAGCCCATCTTTTTAGGTATGTAATTAAAATTTTTAATTAATGTTTTCCTA 

AATAAAATTTTCGGGTAGAAAAATCCATACATTAATTTTAAAAATTAATTACAAAAGGAT 

Fokl BscGI 
Maelll BspMI Bfal Taal | Mnll | 

I I III II 

GTGTAACCTGCTTCTTTAGGAACTACACTAGGAGAACGGTATGTCATCAAATCTACATCC 

61 + + + + + + 120 

CACATTGGACGAAGAAATCCTTGATGTGATCCTCTTGCCATACAGTAGTTTAGATGTAGG 



Bbvl 
Hinf I 
Ddel 
Hpyl78III 
Fnu4HI 
Alul 



Sthl32I 
BslI | 
II 



Bbvl 



CviJI 
MspAlI 
PvuII 
Tsel 
BseMII | 
Fnu4HI j 
Tsel | j 
II I 



Plel 
I 



Mnll 
I 



CGTAGGAGGAACAGGAACAGGAGCAGCTGCTCCTGAGTCTGTGCTAAACATAGTAGAGGA 
GCATCCTCCTTGTCCTTGTCCTCGTCGACGAGGACTCAGACACGATTTGTATCATCTCCT 



180 



Fnu4HI 
Tsel | 
Sthl32I | | 

I II 



Maelll 
Tsp45I 
Bbvl | 
SfaNI | 
HphI | | 

I I I 



MspAlI 
Acil | 



Xcml 
ScrFI 
BspGI 
ECORII 
Tthlllll 
BspGI 
Cjel | 
Maell I 



AccI Tsp509I 



181 



|BsrI 

III III 
AATAGCAGCATCGGGGAGTGTCACCGCTGGTCTACAAGCAATTACGTCCAGTCCAGGAAT 

+ + + + + + 

TTATCGTCGTAGCCCCTCACAGTGGCGACCAGATGTTCGTTAATGCAGGTCAGGTCCTTA 



240 
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Figure 2B 



Hinf I 
Tf il 



Bed 
HphI Cjel | 
I I I 



Apol 
Tsp509I 
Fokl | 
I I 



Hinf I 
Tf il 
BsaAI | 
Maell | | 
I I I 



GGTGAATCTACTCATAGGATGGGCAAAGACAAAATTTATTCAACCTATACGTGAATCAAA 

241 + + + + + + 300 

CCACTTAGATGAGTATCCTACCCGTTTCTGTTTTAAATAAGTTGGATATGCACTTAGTTT 



Tsp509I 
Cac8I 
Alul | 

Alul CviJI j 

CviJI Hpyl7 8III | j 

I III 



Apol 
EcoRI 
Tsp509I 



CjePI 

I I 

GCTCTTTCAATCCAGAGCTTGCCAAATTACCCTGCTCGTTTTAGGAATTCTTTTGGTTGT 

301 + + + + + + 360 

CGAGAAAGTTAGGTCTCGAACGGTTTAATGGGACGAGCAAAATCCTTAAGAAAACCAACA 



Cjel 
MboII | 

NlaIIl| | Cjel 
CjePI | j | BsrI Nsil | 

Mwol| Nsplj | CviJI | BslI CviRI || Bbvl 

II III II i I II I 

TGCTGGATTAGCATGTATGTTTATCTTCCATAGCCAGTTAGGGGCAAATGCATTTTGGTT 

361 + + + + + + 420 

ACGACCTAATCGTACATACAAATAGAAGGTATCGGTCAATCCCCGTTTACGTAAAACCAA 



Maelll 

Fnu4HI Maelll Bfal | 

Tsel| Msel | Spel| j MslI Hindlll 

II I I III I I 

GATTATTCCTGCTGCCATAGGATTGATTAAGTTACTAGTTACATCATTATGTTTTGATGA 

421 + + + + + + 480 

CTAATAAGGACGACGGTATCCTAACTAATTCAATGATCAATGTAGTAATACAAAACTACT 



Hpyl8 8IX 
Rsal 
BsrGI 
Tat I 

Alul | | | Dpnl 

CviJI | | j Nlalll BspMI BslI Aarl Sau3AI | 

I I I I I II I II 

AGCTTGTACATCTGAAAAACTCATGGTTTTCCAAAAATGGGCAGGTGTTTTAGAAGATCA 

481 + + + + + + 540 

TCGAACATGTAGACTTTTTGAGTACCAAAAGGTTTTTACCCGTCCACAAAATCTTCTAGT 
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Figure 2C 



Dpnl 
NlalV 
BamHI 
BstYI 
Sau3AI 
AceIIl| 

Bed | j j Nlalll 
Alwl III | CviJI 
MboII | | | | | Hael 
Taql || | | | | Haelll 
Alul Ml | I j j Alwl Mscl 
CviJI III III |MseI | Eael | 

I I I I III I I I II 
GCTCGATGATGGGATCCTTAATAACTCAAATAAGATTTTTGGCCATGTGAAAACAGAAGG 

541 + + + + + + 6 

CGAGCTACTACCCTAGGAATTATTGAGTTTATTCTAAAAACCGGTACACTTTTGTCTTCC 



Mnll 
CviJI | 
Bfal | | 
I I I 



Msel 
Rsal 
Seal 
Tat I | 
Bmrl | j 
Bsrl | j j 

III I 



SacII 
Acil 
MspAlI 
Thai 
Acil 
BsaJI 
BstDSI 
Fnu4HI 
Taul 
CviJI 
Haelll 
Bed | 
Eael j 
Gdill j 
I I 



Rsal 
TatI | 
Hphl| | 
II I 



AAATACCTCTAGGGCTACTACCCCAGTACTTAATGATGGCCGCGGAACTCCTGTACTTTC 

601 4- + + + + + 6 

TTTATGGAGATCCCGATGATGGGGTCATGAATTACTACCGGCGCCTTGAGGACATGAAAG 



Thai Tthlllll 

Cac8I | SfaNI | 

Alul | | Fokl SfaNI | j 

CviJI | | Maell | Bf al | | | 

III II II I I 

ACCTTTAGTAAGTAAAATAGCTCGCGTTTAGACGTTCATCTCACAAGCATCCTAGAACTT 

TGGAAATCATTCATTTTATCGAGCGCAAATCTGCAAGTAGAGTGTTCGTAGGATCTTGAA 
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Title: Chlamydia Antigens and 
Corresponding DNA Fragments and Uses 
Thereof 

Inventor(s): Andrew D. MURDIN et al. 
Appl. No.: 09/868,987 



Hpyl8 8IX 
Dpnl 
Sau3AI 



Rsal 
BsaAI | 
Sunl j 
Maell| | 
Fokl | | | 



III I III 



Tsp509I 
Taal I 



GGGATGCTACTTTCCACGTACGAGATCAGATGTAAAGAGCAACAGTAATTATTTTCTACA 

721 + + + + + + 780 

CCCTACGATGAAAGGTGCATGCTCTAGTCTACATTTCTCGTTGTCATTAATAAAAGATGT 



TspRI 

Taal | Nlalll 

I I I 
CTGTTGTAATAAAATCATGT 

781 + + 800 

GACAACATTATTTTAGTACA 
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Figure 3A 

Restriction enzyme analysis of CPN100709 (RY 57 - 

Dpnl 
Sau3AI 
Hpyl78III 
Alwl 
Hinf I 
Tf il 
Taal 
Bcgl 
Nsil 



J 

SEQ ID NO. 3) 



Dpnl 
Sau3AI | 
Cac8I I 



CviRI | 

Nlalll j 

Nspl | 

I I 



Taql 



Acil 
I 



Rsal 
TatI I 



TGCTGGCAGATCGTTTCCACATGCATACTGTGAATCTCGATCCCTATGCGGAAAATGTAC 
1 + + + + + + go 

ACGACCGTCTAGCAAAGGTGTACGTATGACACTTAGAGCTAGGGATACGCCTTTTACATG 

ApOl 
EcoRI 

Bcgl Msel Bfal Tsp509I 

II II 

TTGTAAACTTAAAAACCATAGCGACGACTTTTTCTAGTTTATGACAATACGAATTCTTGC 

6X + + + + + + 120 

AACATTTGAATTTTTGGTATCGCTGCTGAAAAAGATCAAATACTGTTATGCTTAAGAACG 



Alul 
CviJI 
Bfal 
CviJI 
Hael 
Haelll 
StuI 



HaelV Nlalll 

Hin4I Taqll 

NlalV | Hpyl78IIl| 

Eco57I Avail | j Real || 

Maelll | Sau96l| j BsmFI | || 

III II II I II II 

TGAAGGCCTAGCTTTCCGTTACGGAAGCAAGGGACCGAATATCATTCATGATGTTTCTTT 

121 + + + + + + 180 

ACTTCCGGATCGAAAGGCAATGCCTTCGTTCCCTGGCTTATAGTAAGTACTACAAAGAAA 



Mnll 

Hinfl Avail | 
Bed Tfil Sau961 j Bsll 

I I I I I 

CTCTGTCTATGATGGCGACTTTATAGGAATCATAGGACCAAACGGAGGGGGGAAAAGCAC 

181 + + + + + + 240 

GAGACAGATACTACCGCTGAAATATCCTTAGTATCCTGGTTTGCCTCCCCCCTTTTCGTG 
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Figure 3B 



Dpnl 
NlalV 
BamHI 
BstYI 
Sau3AI 
Hpyl88IX| 

Tsp509I Cac8I BslI | | 

Msel Msel | CviJI | Alwl | | | 

I II II I I II 

CTTAACGATGTTAATTTTGGGCTTGCTTACTCCTACATTCGGATCCTTGAAGACTTTCCC 

241 + + + + + + 300 

GAATTGCTACAATTAAAACCCGAACGAATGAGGATGTAAGCCTAGGAACTTCTGAAAGGG 



Bbsl 
Xmnl | 
Alwl | j 

I I I 



SacII 
Acil 
MspAlI 
Thai 
Acil 
BsaJI 
BstDSI 
BsmI 
Faul 
Sthl32I | 
MboII | | 



Dpnl 
Nlalll | 
Sau3AI I 



Alwl 



NlalV 

Cjel Sau3AI | j CjePI | Cjel | 

I III I I II 

TTCGCATTCCGCGGGGAAACAAACCCATTCCATGATCGGTTGGGTTCCCCAACATTTCTC 

301 + + + + + + 360 

AAGCGTAAGGCGCCCCTTTGTTTGGGTAAGGTACTAGCCAACCCAAGGGGTTGTAAAGAG 



BsmAI 

Dpnl Hpyl7 8III MboII 

Sau3AI | CjePI Ddel BseMII Ddel |MnlI BseMII | 

II I I I I I I I I 

TTATGATCCTTGTTTTCCTATCTCAGTAAAAGATGTTGTCCTCTCAGGAAGATTGTCTCA 

361 + + + + + + 420 

AATACTAGGAACAAAAGGATAGAGTCATTTTCTACAACAGGAGAGTCCTTCTAACAGAGT 



Dpnl 
Mmel 
Sau3AI 
Sfcl | | Dpnl 
ScrFI Alul| | | BstYI | 

EcoRII | Nlalll Cvi JI | j | Sau3AI | 

III II I I II 

ACTCTCCTGGCATGGAAAATATAAAAAGAAAGATTTTGAAGCTGTAGATCACGCTTTGGA 

421 + + + + + + 480 

TGAGAGGACCGTACCTTTTATATTTTTCTTTCTAAAACTTCGACATCTAGTGCGAAACCT 
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Figure 3C 



TspRI 
BtsI | 



Hpyl78III 
Ddel | 
Mnll | | 
Bed | | j 
I I I I 



BseMII 
Hin4I I 



Alwl Hpyl8 8IX 
I I 

TCTTGTTGGACTTTCTGACACCACCACCACTGCTTTCGCCCATCTCTCAGGAGGACAAAT 

481 + + + + + + 54 

AGAACAACCTGAAAGACTGTGGTGGTGGTGACGAAAGCGGGTAGAGAGTCCTCCTGTTTA 



Rsal 
Tat I | 



CviJI 
BpulOI | 
Ddel | 
CviJI | j 

I I I 



Hpyl78III 
Tsp509I 
Apol | 
Tsp509I j 
Mnll | Mselj 
I I II 



CCAGCGTGTACTTCTGGCAAGAGCCTTAGCCTCCTACCCTGAAATTTTAATTCTTGATGA 

541 + + + + + + 60 

GGTCGCACATGAAGACCGTTCTCGGAATCGGAGGATGGGACTTTAAAATTAAGAACTACT 



CviJI 



Tthlllll 
Hpyl78III 
Dpnl | 
Sau3AI | j 
Alwl | | j 

I III 



Alul 
CviJI 
BciVI I 



Apol 
Tsp509I Msel 

II II 
GCCGACGACAAACATTGATCCTGACAATCAACAAAGAATTTTAAGTATCCTAAAAAAGCT 

601 + + + + + + 66 

CGGCTGCTGTTTGTAACTAGGACTGTTAGTTGTTTCTTAAAATTCATAGGATTTTTTCGA 



BsiHKAI 
Bspl286I 

BseSI | Dpnl 

CviRI j Sau3AI 

ApaLI | j Hpyl7 8III 

BsaAI | | | HphI 

Maell | j | | MboII 

Rsal III | j Maelll | 

Sunl | | | j | | BstXI | | 

Taal j | j | | j MslI | j | | | | | Tsp509I Msel 

I I I I I II I I I I 
CAACCGTACGTGCACCATTCTTATGGTAACTCACGATCTTCACCATACGACGAATTACTT 

GTTGGCATGCACGTGGTAAGAATACCATTGAGTGCTAGAAGTGGTATGCTGCTTT^ATGAA 



72 



Bcgl 

CviRI TaqI Msel 

I I I 

TAATAAAGTTTTTTATATGAACAAAACTTTGCACTTCATTGGCAGACACTTCGACCTTAA 

721 + + + + + + 78 

ATTATTTCAAAAAATATACTTGTTTTGAAACGTGAAGTAACCGTCTGTGAAGCTGGAATT 
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gure 3D 



BseRI 

Tsp509I Apol | 

Bcgl| Tsp509I | 

Fokl | | Hpyl78III | |NlaIII Mnll 

III I I I I I 
CAGACCAATTTTGTTGTCATCCTATAAAAATCAGGAATTTTCATGCTCTCCTCACTAATC 
781 + + + + + + 8 

GTCTGGTTAAAACAACAGTAGGATATTTTTAGTCCTTAAAAGTACGAGAGGAGTGATTAG 



RleAI 
Fnu4HI | BsaXI 
Taul j CviJI | 

Hinfl Acil| | NlaIV| | 

Tf il Bfa-I | j |MwoI | | j 

I I II I I II I 

CGTGATTCATTTCCCCTTCTTATTTTACTTCCCACATTCCTAGCGGCATTAGGAGCCTCC 

841 + + + + + + 9 

GCACTAAGTAAAGGGGAAGAATAAAATGAAGGGTGTAAGGATCGCCGTAATCCTCGGAGG 



Fnu4HI 
Taul 
Acil 
Cac8I 
Mnll 
Alul | 
CviJI | 

Mwol | | | I NlalV Maell 

I I Ml I I 

GTAGCTGGCGGCGTTATGGGAACCTATATCGTTGTAAAACGTATTGTTTC 

901 + + + + + 950 

CATCGACCGCCGCAATACCCTTGGATATAGCAACATTTTGCATAACAAAG 
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Figure 4A 

Restriction enzyme analysis of CPN100710 (RY 58 - SEQ ID NO. 4) 



Dpnl 

Xmnl Sau3AI | 
Apol | Ddel | j 
Tsp509I |HphI | | |AciI 
I I I I I I I 



Ddel 



Tsp509I 
Msel I 



GAGAATTTTTTCCTAAGATCACCGCTTCTTAGGATATTCGTTCTTTATTAAAATTATGCC 
CTCTTAAAAAAGGATTCTAGTGGCGAAGAATCCTATAAGCAAGAAATAATTTTAATACGG 



60 



Dpnl 
Sau3AI | 
I I 



Nsil 
CviRI | 
Nlalll 



CCAATAGAATAATAGATCATCTTATCAAACTGCTTTTGTCATGCATAAAGTAATAGTTTT 
61 + + + + + + 120 

GGTTATCTTATTATCTAGTAGAATAGTTTGACGAAAACAGTACGTATTTCATTATCAAAA 



Msel 



CviJI 



CATTTTCCTTACCCTATATTCGTTAAAAAGTTATGGGAATGATGTAATAGATAAGCCCCA 

121 + + + + + + 180 

GTAAAAGGAATGGGATATAAGCAATTTTTCAATACCCTTACTACATTATCTATTCGGGGT 



Nlalll 



Bsal 
BsmAI 
Earl [ 
Tsp509I | | 

I II III 

TGTTCTTGTCAGTATCGCCCCCTATAAATTCCTAGTTGAACAAATTGCTGAAGAGACCTG 

181 + + + + + + 240 

ACAAGAACAGTCATAGCGGGGGATATTTAAGGATCAACTTGTTTAACGACTTCTCTGGAC 



Apol 
Tsp509I Bfal 



Dpnl 
Sau3AI | 

Alwl | | BbvCI 

Hinfl | | | BseRI BpulOI 

MboII Eco57I Maelll Tfil | | | MslI | Ddel 

I I I I I I I II I 

TTTTGTCTATGCGATAGTTACGAATCACTATGATCCCCATACCTATGAACTTCCTCCTCA 

241 + + + + + + 300 

AAAACAGATACGCTATCAATGCTTAGTGATACTAGGGGTATGGATACTTGAAGGAGGAGT 



Maelll 
BseMII | 

Mnll | j Bsal NlalV 

Mnll | || BsmAI Drdll| Mnll 

I I II I II I 

GCAAATCAAGGAGTTACGACAAGGAGACCTTTGGTTCCGTATAGGAGAGGCATTTGGAAA 

301 + + + + + + 360 

CGTTTAGTTCCTCAATGCTGTTCCTCTGGAAACCAAGGCATATCCTCTCCGTAAACCTTT 



Figure 4B 
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Dpnl 

CviRI CjePl| Hinfl 

Nlalll Sau3Al|| Tfil 

Nspl Taql | | | BsmAI | 

I I I I I I I 

AAACTTGTTAGAGAAACCTTACATGCAACAAGTCGATCTTTCCCAAAATGTCTCGCTGAT 

361 + + + + + + 

TTTGAACAATCTCTTTGGAATGTACGTTGTTCAGCTAGAAAGGGTTTTACAGAGCGACTA 



420 



CviJI 

CviJI Pf 111081 Taqll| 

CjePI | Cjel | BslI Msel | | 

ii ii i i ii 

TCAAGGAAAGCCTTGCTGTAATCAACATACCACGAACTACGACACCCACACTTGGTTAAG 

421 + + + + + + 480 

AGTTCCTTTCGGAACGACATTAGTTGTATGGTGCTTGATGCTGTGGGTGTGAACCAATTC 

Msel Maelll 
RleAI Cjel | BsmAI Cjel | Msel 

I II I II I 

CCCTAAAAACCTTAAAGTCCAAGTGGAGACTATCGTTACCACTTTAAGTAAAAAATATCC 

481 + + + + .+ + 540 

GGGATTTTTGGAATTTCAGGTTCACCTCTGATAGCAATGGTGAAATTCATTTTTTATAGG 



HaelV 

Hin4I 
Hinfl | 
Mnll| | 

Thai | j | Avail 
Bsbl III | Sau96I 
Cjel III j Alul | 

Plel III j BsrDI CviJI j Mnll 

I III I I III 

TCAACACGCGACTCTATATCAAAGCAATGGAGAGAAACTTCTGTTAGCTTTGGACCAACT 

541 + + + + + -f 600 

AGTTGTGCGCTGAGATATAGTTTCGTTACCTCTCTTTGAAGACAATCGAAACCTGGTTGA 



BsaJT 
BstDSI 

Apol NCOI 
Tsp509I Mnll Styl 

I I I 

CAATGAGGAAATTCTTACGATTACCTCCAAAGCGAAACAACGCCATATTTTAGTTTCCCA 

601 + + + + + + 660 

GTTACTCCTTTAAGAATGCTAATGGAGGTTTCGCTTTGTTGCGGTATAAAATCAAAGGGT 



Figure 4C 



Beef I 
CviJI 
Xcml 
Nlaiv| 
Nlalll | j 
I II 
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Tsp509I 



Ddel 



BseMII 
Sfcl | 



TGGAGCCTTTGGGTATTTTTGCCGTGATTACAATTTCTCTCAGCACACTATAGAGAAAAG 

661 + + + + + + 720 

ACCTCGGAAACCCATAAAAACGGCACTAATGTTAAAGAGAGTCGTGTGATATCTCTTTTC 



CviJI 
Nlalll | 
I I 



Hpyl78III 
Thai | 
Cac8I | Maelllj 
CviJI | j Tsp4 5l| 
III II 



MboII 
Rsal | 
Taal | | 
TatI | | 
I II 



CAGTCATGTTGAGCCTTCTCCTAAAGATGTGGCTCGCGTATTTCGTGACATTGAACAGTA 
GTCAGTACAACTCGGAAGAGGATTTCTACACCGAGCGCATAAAGCACTGTAACTTGTCAT 



780 



Hpyl78III 

Apol Hinfl | MboII Cac8I 

Tsp509I MboII Tfil Taql Hpyl78III Bbsl | Mwol | 

I I I I I I I II 

CAAAATTTCTTCTGTGATTCTTCTCGAATACTCTGGAAGACGAAGTAGTGCTATGCTGGC 

781 + + + + + + 840 

GTTTTAAAGAAGACACTAAGAAGAGCTTATGAGACCTTCTGCTTCATCACGATACGACCG 



Dpnl 
Sau3AI 
Hpyl78III 
Alwl 
Hinf I 
Tfil 
Taal 
Bcgl 
Nsil 



Dpnl 
Sau3AI | 
I I 



CviRI 
Nlalll 
Nspl 
I 



Taql 



Acil 



Bcgl 
Rsal | 
TatI | | 
I I I 



AGATCGTTTCCACATGCATACTGTGAATCTCGATCCCTATGCGGAAAATGTACTTGTAAA 

841 + + + + + + 900 

TCTAGCAAAGGTGTACGTATGACACTTAGAGCTAGGGATACGCCTTTTACATGAACATTT 



CviJI 

Apol Hael 
EcoRI Haelll 

Msel Bfal Tsp509I StuI 

I III 

CTTAAAAACCATAGCGACGACTTTTTCTAGTTTATGACAATACGAATTCTTGCTGAAGGC 

901 + + + + + + 960 

GAATTTTTGGTATCGCTGCTGAAAAAGATCAAATACTGTTATGCTTAAGAACGACTTCCG 




Title: Chlamydia Antigens and 
Corresponding DNA Fragments and Uses 
Thereof 

Inventor(s): Andrew D. MURDIN et al. 
Appl. No.: 09/868,987 



Figure 4D 



HaelV Nlalll 

Hin4I Taqll 

NlalV | Hpyl78IIl| 

Avail | j Real | j 

Sau96I | j BsmFI | j j 

II I II II 

CTAGCTTTCCGTTACGGAAGCAAGGGACCGAATATCATTCATGATGTTTCTTTCTCTGTC 

GATCGAAAGGCAATGCCTTCGTTCCCTGGCTTATAGTAAGTACTACAAAGAAAGAGACAG 



Alul 
CviJI ECOS7I 
Bfal |MaeIII | 
II II 



1020 



Hinf I 
Tf il 



Mnll 
Avail | 
Sau96I 



BccI Tfil Sau96I | BslI Msel 

I III | 

TATGATGGCGACTTTATAGGAATCATAGGACCAAACGGAGGGGGGAAAAGCACCTTAACG 

1021 + + + + + + 1080 

ATACTACCGCTGAAATATCCTTAGTATCCTGGTTTGCCTCCCCCCTTTTCGTGGAATTGC 



Dpnl 
NlalV 
BamHI 
BstYI 
Sau3AI 
Hpyl8 8IX| 

Tsp509I Cac8I BslI | | 

Msel | CviJI | Alwl | j | I Alwl 

II II I I II 

ATGTTAATTTTGGGCTTGCTTACTCCTACATTCGGATCCTTGAAGACTTTCCCTTCGCAT 

1081 + + + + + + 1140 

TACAATTAAAACCCGAACGAATGAGGATGTAAGCCTAGGAACTTCTGAAAGGGAAGCGTA 



BsmI 

Bbsl Faul 
Xmnl |Sthl32l| 
| |Mboii | | 
II III 



SacII 
Acil | 
MspAlI j 
Thai | 
Acil | j 
BsaJI | | 
BstDSI | j 
I II 

TCCGCGGGGAAACAAACCCATT 

1141 + + -- 1162 

AGGCGCCCCTTTGTTTGGGTAA 




Title: Chlamydia Antigens and ^ 
Corresponding DNA Fragments and Uses 
Thereof 

Inventor(s): Andrew D. MURDIN et al. 
Appl. No.: 09/868,987 

Figure 5A 

Restriction enzyme analysis of CPN100711 (RY 59 - SEQ ID NO. 5) 

BslI 
Dpnl 
Sau3AI 
ScrFI 
Apal 
Banll 
Bspl286I 
BsaJI 
EcoRII 

Bmgl 
BseSI 
CviJI 
Haelll 
NlalV 

Sau96I | | | | | | Hinf I 

Sau96I | I | | | I | Mmel 
Cjel I I I I I I I | Alwl Cjel Mnll Tfil 

I III 

ACAATCACTATGGGCCCAGGATCGGTTCTTTCCAACCATAGCAAAGAAGCAGGAGGAATC 
TGTTAGTGATACCCGGGTCCTAGCCAAGAAAGGTTGGTATCGTTTCTTCGTCCTCCTTAG 



60 



XmnI CviRI Taal 

I I I 

GCTATAAACAATGTCATCATTGATTTTAGTGAAATCGTTCCTACTAAAGATAATGCAACA 

61 + + + + + + 120 

CGATATTTGTTACAGTAGTAACTAAAATCACTTTAGCAAGGATGATTTCTATTACGTTGT 



Alul 

CviJI Hpyl78III 
BsaXI | Tsp509I TaqI | 

Mwol| | Msel |TaqII | j CviRI 

III I I I I I I 

GTAGCTCCACCCACTCTTAAATTAGTATCGAGAACTAATGCAGATAGTAAAGATAAGATT 

121 + + + + + + 180 

CATCGAGGTGGGTGAGAATTTAATCATAGCTCTTGATTACGTCTATCATTTCTATTCTAA 



igure 
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SB 



Dpnl 
BstYI 
Sau3AI 
Earl 
Hpyl78III 
Hinf I 
Ppil 
Maelll 
Taal 
Tsp45I 
AlwNI | | | Bf al 
MboII | | j Xbal | 
Plel | | | j Alwl | | 
II II I I II 



% 
% 



<2 



Apol 
Tsp509I 

I 



181 



GATATTACAGGAACTGTGACTCTTCTAGATCCTAATGGCAACTTATATCAAAATTCTTAT 



CTATAATGTCCTTGACACTGAGAAGATCTAGGATTACCGTTGAATATAGTTTTAAGAATA 



-+ 240 



MboII 
EcoRV 
HphI 
Bbsl | 

Thai | | | Maelll 
Acil | | | j Tsp509I CviRI Mwol | 

I I I II I III 

CTTGGTGAAGACCGCGATATCACTCTTTTCAATATAGACAATTCTGCAAGTGGGGCAGTT 

241 + + + + + + 300 

GAACCACTTCTGGCGCTATAGTGAGAAAAGTTATATCTGTTAAGACGTTCACCCCGTCAA 



HphI Apol ScrFI 

CviJI | Maelll Tsp509I Alul EcoRII | 

Mwol | |Tsp45I BslI | CviJI NlaIV| | 

I I I I II I III 

ACAGCCACGAATGTCACCCTTCAAGGGAATTTAGGAGCTAAAAAAGGATATTTAGGAACC 

301 + + + + + + 360 

TGTCGGTGCTTACAGTGGGAAGTTCCCTTAAATCCTCGATTTTTTCCTATAAATCCTTGG 



'Op 
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Figure 5C 



Aval 
BsaJI 
Alwl 
Apol 



Tsp509I 
Sthl32I 
Dpnl 
NlalV 
BamHI 
BstYI 
Sau3AI 
Alwl | 
Apol | | 
Tsp509l| | 



Avail 
Sau96I Cjel 



Tsp509I 
Mnll | 

II II I II II I 

TGGAATTTGGATCCAAATTCCTCGGGTTCAAAAATTATTCTAAAATGGACCTTTGACAAA 

361 + + + + + + 420 

ACCTTAAACCTAGGTTTAAGGAGCCCAAGTTTTTAATAAGATTTTACCTGGAAACTGTTT 



CviJI 
Haelll 
BspMI 
Sau96I 
Cac8I | 
Hhal | | 
Fokl | j | 
I I II 



Cjel 
Bfal | 
BsmAI I 



Cjel 
I 



TACCTGCGCTGGCCCTACATCCCTAGAGACAACCACTTCTACATCAACTCTATTTGGGGA 

421 + + + + + + 480 

ATGGACGCGACCGGGATGTAGGGATCTCTGTTGGTGAAGATGTAGTTGAGATAAACCCCT 



Ddel 
Dpnl 
BstYI 
Sau3AI 
BsaJI 

Maelll Styl 
BsiHKAI Tsp45I Drdll | 

Bspl286I Cjel | Taal | j 

I I I I II 

GCACAAAACTCTTTAGTGACTGTGAACCAAGGGATCTTAGGGAACATGTTGAACAATGCA 

481 + + + + + + 540 

CGTGTTTTGAGAAATCACTGACACTTGGTTCCCTAGAATCCCTTGTACAACTTGTTACGT 



Nlalll 
AflHI | 
BspLUllI j 
Alwl |NspI 

I I I 



CviRI 



Dpnl 
Cjel 
BstYI | 
Sau3AI | | Bsu36I 
Sfcl | j j Ddel 
MboII CviJI CviJI | III Alwl | 

I I II III II 

AGGTTTGAAGATCCTGCTTTCAACAACTTCTGGGCTTCGGCTATAGGATCTTTCCTTAGG 

541 + + + + + + 600 

TCCAAACTTCTAGGACGAAAGTTGTTGAAGACCCGAAGCCGATATCCTAGAAAGGAATCC 



Cjel 
Dpnl 
BstYI | 
Sau3AI | 
Alwl | j 

I I I 
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Figure 5D 



Hinf I 
Cjel | 
HphI I 
Hpyl88IX| 
Plel 
Apol | 



Tsp509I 
Hpyl78III | 
Taqi | 

I I 



Bbvl 
Nlalll | 
Mnll 



CviJI 



Fnu4HI 
Cjel 
MspAlI 
Tsel 
Sf aNI | 
Acil | | 
Ml 



AAAGAAGTATCTCGAAATTCTGACTCATTCACCTATCATGGCAGAGGCTATACCGCTGCT 

601 + + + + + + 660 

TTTCTTCATAGAGCTTTAAGACTGAGTAAGTGGATAGTACCGTCTCCGATATGGCGACGA 



Mwol 



Eco57I 
Bbvl 
Acelll 
Mnll 
Apol | 
Tsp509I | 
Fokl | | 

I I I 



Fnu4HI 
Alul | 

CviJI j 
Tsel j 



Maelll 
Tsp4 5I 



GTGGATGCCAAACCTCGCCAAGAATTTATTTTAGGAGCTGCCTTCAGTCAGGTTTTTGGT 

661 + + + + + + 

CACCTACGGTTTGGAGCGGTTCTTAAATAAAATCCTCGACGGAAGTCAGTCCAAAAACCA 



720 



Hpyl8 8IX 
HphI | 
Hinf I | j Plel 

III I 



Maelll 
Tsp45I 

BpulOI | 
Ddel | 

CviJI | j 

I I I 



BseMII 



CACGCCGAGTCTGAATATCACCTTGACAACTATAAGCATAAAGGCTCAGGTCACTCTACA 

721 + + + + + + 780 

GTGCGGCTCAGACTTATAGTGGAACTGTTGATATTCGTATTTCCGAGTCCAGTGAGATGT 



Cac8I 
MboII 
Tthlllll | 
Sf aNI | j 
I I I 



CviJI 
Haelll 
Taal Bsal | 
Hin4l| BsmAlj 
II II 



CAAGCATCTCTTTATGCTGGCAATATCTTCTATTTTCCTGCGATACGGTCTCGGCCTATT 

781 + + + + + + 840 

GTTCGTAGAGAAATACGACCGTTATAGAAGATAAAAGGACGCTATGCCAGAGCCGGATAA 



BsaJl BslI 
Styl PflMI CviRI Nlalll 

II II 
CTATTCCAAGGTGTGGCGACCTATGGTTATATGCAACATGACACCACAACCTACTATCCT 

841 + + + + + + 900 

GATAAGGTTCCACACCGCTGGATACCAATATACGTTGTACTGTGGTGTTGGATGATAGGA 



Figure 5E 
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BsrDI 
BsrI | 

Tthlllll | j Dpnl 
MboII Bmrl | j j Sau3AI | 

I II I I II 

TCTATTGAAGAAAAAAATATGGCAAACTGGGATAGCATTGCTTGGTTATTTGATCTGCGT 

901 + + + + + + 960 

AGATAACTTCTTTTTTTATACCGTTTGACCCTATCGTAACGAACCAATAAACTAGACGCA 



Alwl 
Msel 
Dpnl 
BstYI | 
Sau3AI j 
TspRI 



Mnll 
Sfcl | 
Mnll I 



| CviJI BseMII 

III I I I I I 

TTCAGTGTGGATCTTAAAGAACCTCAACCTCACTCTACAGCAAGGCTTACCTTCTATACA 

961 + + + + + + 1020 

AAGTCACACCTAGAATTTCTTGGAGTTGGAGTGAGATGTCGTTCCGAATGGAAGATATGT 



Apol 
EcoRI 
Tsp509I 
BstZ17I 
Ddel 



Alul | 
AlwNI j 
CviJI | 

II 



ACC-I 



Apol 
Tsp509I 
ScrFI | 
EcoRII | | 
I I I 



Dpnl 
Bglll 
BstYI 
Sau3AI 
Bf al 
HaelV 
Hin4I 
Dpnl 
Sau3AI 



Alwl 
Bf al | 
Alul | | 
CviJI | | 

I I I 



GAAGCTGAGTATACCAGAATTCGCCAGGAGAAATTCACAGAGCTAGACTATGATCCTAGA 

1021 + + + + + + 1080 

CTTCGACTCATATGGTCTTAAGCGGTCCTCTTTAAGTGTCTCGATCTGATACTAGGATCT 



Nlalll 
Nspl 

SphI BsrI 
Cac8I | Tsp509I Hinf I | AccI 

CviRI | j Ddel | Tfilj Sfcl | 

III II II I I 

TCTTTCTCTGCATGCTCTTATGGAAACTTAGCAATTCCTACTGGATTCTCTGTAGACGGA 

1081 + + + + + + 1140 

AGAAAGAGACGTACGAGAATACCTTTGAATCGTTAAGGATGACCTAAGAGACATCTGCCT 
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Figure 5F 



Fnu4HI 
Alul | 
CviJI j 
MspAlI j 

Alul Pvullj Hinfl 

CviJI Bbvl Tselj Rsal Tfil 

I I II I I 

GCATTAGCTTGGCGTGAGATTATTCTATATAATAAAGTATCAGCTGCGTACCTCCCTGTG 

1141 + + + + + + 12 

CGTAATCGAACCGCACTCTAATAAGATATATTATTTCATAGTCGACGCATGGAGGGACAC 



Hpyl78III 
Ddel | 

Mnll | j BseMII Maell 

I I I I I 

ATTCTCAGGAATAATCCAAAAGCGACCTATGAAGTTCTCTCTACAAAAGAAAAGGGCAAC 

1201 + + + + + + 12 

TAAGAGTCCTTATTAGGTTTTCGCTGGATACTTCAAGAGAGATGTTTTCTTTTCCCGTTG 



Bsgl 
HphI 
Apol 
Tsp509I 
Banll 
BsiHKAI 
Bspl286I 
Sad 
Alul 



CviJI 
Hin4I 
Acelll 
Bbvl 
CviRI 
BstAPI 
Mwol 
Mnll 
BssSI 
Alul [ 

Acll CviJI | 

Maell Fnu4HI | 

Hindi | Tsel| | 

II III 
GTAGTCAACGTTCTCCCTACAAGAAACGCAGCTCGTGCAGAGGTGAGCTCTCAAATTTAT 

1261 + + + + + + 13 

CATCAGTTGCAAGAGGGATGTTCTTTGCGTCGAGCACGTCTCCACTCGAGAGTTTAAATA 



C 
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Figure 5G 



BstZ17I 
AccI | 
Sf aNI | 

Maelll BsrI BsaAI | | 

BplI | BspGI | Maell|||| Beef I 

III Ml 
CTTGGAAGTTACTGGACACTCTACGGCACGTATACTATTGATGCTTCAATGAATACTTTA 

1321 + + + + + + 1380 

GAACCTTCAATGACCTGTGAGATGCCGTGCATATGATAACTACGAAGTTACTTATGAAAT 



CviJI 
Hael 

Haelll 
Mnll 
MscI 

Eael | 



Mspl 
BsaWI 
Dpnl 
NlalV 
BamHI 
BstYI 
Sau3AI 
BslI | 
Alwl | | 



Msel 
Tsp509I | 
BstZ17I | | 

CviRI Eael | Alwl | | |||AlwI Bfal AccI | | | 

I II I I I III I I II I I 

GTGCAAATGGCCAACGGAGGGATCCGGTTTGTATTCTAGGGTATACAATTAAAGATTTTA 

1381 + + + + + + 1440 

CACGTTTACCGGTTGCCTCCCTAGGCCAAACATAAGATCCCATATGTTAATTTCTAAAAT 



BciVI 
Tsp509I | 
Mnll | | 
I II 



NspV 
TaqI Taal | 

Hinf I | BsaJI | | 
Tfil (BstDSI | I 
II III 



Hindi 
Hpal 
Msel 
Thai 
Afllll 
Mlul 
Rsal 



CjePI 
I 



TGAAATTGAGGATACGGAGAGAGTGGGATTCGAACCCACGGTACGCGTTAACGCACACAC 

1441 + + + + + + 1500 

ACTTTAACTCCTATGCCTCTCTCACCCTAAGCTTGGGTGCCATGCGCAATTGCGTGTGTG 



CjePI 
Hpyl88IX 
CviJI 
Mwol 



Msel 
Af III 
Smll 
BsiHKAI 
Bspl286I 
Cac8I | 
Mwol | | 



I I 



GCTTTCCAAGCGTGCTCCTTAAGCCACTCGGACATCTCTCCATATTTATA 

1501 + + + + + 1550 

CGAAAGGTTCGCACGAGGAATTCGGTGAGCCTGTAGAGAGGTATAAATAT 
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Figure 6A 

Restriction enzyme analysis of CPN100877 (RY 61 



SEQ ID NO. 6) 



Cac8I 
Mwol | 

Maelll Cvi JI | 

Tsp45I Apol BsiHKAI | j 

Msel | Tsp509I Bspl286I | j 

II I I II 

AATTCTTTTTAAGTGACAAGAAATTCTTGTGCTCGGCTTGCTTTCTTATTCTTATTGACG 

TTAAGAAAAATTCACTGTTCTTTAAGAACACGAGCCGAACGAAAGAATAAGAATAACTGC 



Mae 1 1 
Tthlllll | 
I I 



Hpyl8 8IX 
Dpnl | 

Bell | | Hin4I 
Sau3AI j j Rsal Mwol Acil 

III I II 
TATTGCTTGATCAGATATTCATTTTGATTTAGGTACTAAAATGCGATTTTCGCTCTGCGG 
61 + + + + + + 120 

ATAACGAACTAGTCTATAAGTAAAACTAAATCCATGATTTTACGCTAAAAGCGAGACGCC 

Ddel 
Bbsl | 
MboII j BseMII 
Bfal Mnll BsrDI | | TaqI | 

II III II 

ATTTCCTCTAGTTTTTTCTTTTACATTGCTCTCAGTCTTCGACACTTCTTTGAGTGCTAC 

121 + + + + + + 180 

TAAAGGAGATCAAAAAAGAAAATGTAACGAGAGTCAGAAGCTGTGAAGAAACTCACGATG 

Acll 
Mae 1 1 

Nlalll Bsml | 

Pflll08I Msel MboII | Hpyl88IX CviRI | | 

I I II I I I I 

TACGATTTCTTTAACCCCAGAAGATAGTTTTCATGGAGATAGTCAGAATGCAGAACGTTC 

181 + + + + + + 240 

ATGCTAAAGAAATTGGGGTCTTCTATCAAAAGTACCTCTATCAGTCTTACGTCTTGCAAG 



Alul 
CviJI 

I 



Fokl 
CviJI | 
Sfcl | | 

I I I 



BsrI 



Bcgl 
HphI 
BsmAI | 

1 1 



TaqI 
Maell | 

I I 



TTATAATGTTCAAGCTGGGGATGTCTATAGCCTTACTGGTGATGTCTCAATATCTAACGT 

241 + + + + + + 300 

AATATTACAAGTTCGACCCCTACAGATATCGGAATGACCACTACAGAGTTATAGATTGCA 



u 
u 

C 
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Figure 6B 



BseMII 

Hpyl78III Maell| 
Cac8I Bsu36I |MaeIII | j 

Msel Bcgl| Maelll | | Mnll || 

CviRI | CviJI || Tsp45I Ddel |Tsp45I || 

II III I I I I II 

CGATAACTCTGCATTAAATAAAGCCTGCTTCAATGTGACCTCAGGAAGTGTGACGTTCGC 

301 + + + + + + 360 

GCTATTGAGACGTAATTTATTTCGGACGAAGTTACACTGGAGTCCTTCACACTGCAAGCG 



Hpyl78III 

BSU36I | BseMII 
Nlalll Msel Sspl Ddel | Mnll | CviJI 

I I I I I I I I 

AGGAAATCATCATGGGTTATATTTTAATAATATTTCCTCAGGAACTACAAAGGAAGGGGC 

361 + + + + + + 420 

TCCTTTAGTAGTACCCAATATAAAATTATTATAAAGGAGTCCTTGATGTTTCCTTCCCCG 



Smll 
Dpnl | 

Bce83I BstYI | | Tthlllll 

Rsal | Sau3AI j | Maell | 

TatI | | Alwl | j | Mnll | | Bcefl 

I I I I I I I III I 

TGTACTTTGTTGCCAAGATCCTCAAGCAACGGCACGTTTTTCTGGGTTCTCCACGCTCTC 

421 + + + + + + 480 

ACATGAAACAACGGTTCTAGGAGTTCGTTGCCGTGCAAAAAGACCCAAGAGGTGCGAGAG 



Msel 
Sthl32I 
Mspl 
Neil 
ScrFI 
Banll 



Bspl286I 
BsaJI | 
CviJI | | 
Hpyl88IX | | | 
I III 



Fokl 



BsmAI 



CviRI 



I I 



TTTTATTCAGAGCCCCGGAGATATTAAAGAACAGGGATGTCTCTATTCAAAAAATGCACT 

481 + + + + + + 

AAAATAAGTCTCGGGGCCTCTATAATTTCTTGTCCCTACAGAGATAAGTTTTTTACGTGA 



540 



541 



Tsp509I Ecil 
Msel | Acil| 

II II 
TATGCTCTTAAACAATTATGTAGTGCGTTTTGAACAAAACCAAAGTAAGACTAAAGGCGG 
+ + + + + + 

ATACGAGAATTTGTTAATACATCACGCAAAACTTGTTTTGGTTTCATTCTGATTTCCGCC 



600 
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Figure 5C 



Alul 
CviJI 



I 



Maelll Sfcl 



Hin4I 
Hinfl | 
Tf il j 
Pflll08I | | 

II III 
AGCTATTAGTGGGGCGAATGTTACTATAGTAGGCAACTACGATTCCGTCTCTTTCTATCA 

601 + + + + + + 

TCGATAATCACCCCGCTTACAATGATATCATCCGTTGATGCTAAGGCAGAGAAAGATAGT 



Hpyl88IX 
BsmAI | 
BsmBI j 
I I 



660 



661 



Mnll 

CviJI | BsmFI NlalV 

BsmI | | MboII Avail 

Fnu4Hl|| | Hin4I | Eco0109I 

CviRl||| | EC057I | | PspSII 

Tseljjj | Bbvl| j | Sau96I |SfcI CviRI 

I I I I I II I I III I 
GAATGCAGCCACTTTTGGAGGTGCTATCCATTCTTCAGGTCCCCTACAGATTGCAGTAAA 
+ + + + + + 

CTTACGTCGGTGAAAACCTCCACGATAGGTAAGAAGTCCAGGGGATGTCTAACGTCATTT 



720 



Rsal 

CjePI Hpyl78III Cjel| 

Cjel | Drdll | Tat I | | 

CviRI | | Mnll | | CviJI | j | 

III II I I III 

TCAGGCAGAGATAAGATTTGCACAAAATACTGCCAAGAATGGTTCTGGAGGGGCTTTGTA 

AGTCCGTCTCTATTCTAAACGTGTTTTATGACGGTTCTTACCAAGACCTCCCCGAAACAT 



780 



Hpyl88IX 
Dpnl 



Bed 
Bpml | 
Hpyl88IX | | 
CjePI | | | 
I I II 



Bell 
Sau3AI 
BsaBI | 
HphI | | 
I I I 



BsmI 



Mnll 
Hpyl78III | 
Taql | | 
I II 



CTCCGATGGTGATATTGATATTGATCAGAATGCTTATGTTCTATTTCGAGAAAATGAGGC 

781 + + + + + + 

GAGGCTACCACTATAACTATAACTAGTCTTACGAATACAAGATAAAGCTCTTTTACTCCG 



840 



Eco57I 
Bbsl | 
MboII | 
CviJI 
BsaXI | 

Sfcl Mnll Hin4l| 

I I II 

ATTGACTACTGCTATAGGTAAGGGAGGGGCTGTCTGTTGTCTTCCCACTTCAGGAAGTAG 

841 + + + + + + 900 

TAACTGATGACGATATCCATTCCCTCCCCGACAGACAACAGAAGGGTGAAGTCCTTCATC 



Hpyl78III 
BslI | 
Bpml | | 
I I I 



TatI 



I 
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gure 6D 



BsrI Taqll 
Rsal | Maelll XmnI | 

Seal | Tsp45I Hpyl88IX Taal Cjel | j 

II I I I I II 

TACTCCAGTTCCTATTGTGACTTTCTCTGACAATAAACAGTTAGTCTTTGAAAGAAACCA 

ATGAGGTCAAGGATAACACTGAAAGAGACTGTTATTTGTCAATCAGAAACTTTCTTTGGT 



960 



961 



CviJT Eco57I 
NlaIV| Bfal | 

Ecil | | Cjel | | MboII 

Acil | | | Mwol | j | Ddel | 

II II I I I I II 

TTCCATAATGGGTGGCGGAGCCATTTATGCTAGGAAACTTAGCATCTCTTCAGGAGGTCC 

+ + + + + + 1020 

AAGGTATTACCCACCGCCTCGGTAAATACGATCCTTTGAATCGTAGAGAAGTCCTCCAGG 



Avail 
ECO0109I 
PspSII 
Sau96I 
Sse8647I 
Earl 
Hpyl78III 
Sf aNI | 
Mnll | | 
I II 



Apol 
Tsp509I 

CviRI | Apol Alul 

Ndel | | Tsp509I CviJI 

III I I 

TACTCTATTTATCAATAATATATCATATGCAAATTCGCAAAATTTAGGTGGAGCTATTGC 

1021 + + + + + + 1080 

ATGAGATAAATAGTTATTATATAGTATACGTTTAAGCGTTTTAAATCCACCTCGATAACG 

Dpnl 
Sau3AI | 

Hin4I | | BsaJT 
Mnll BsrI | | | Bpml Tsp509I Styl 

I I I I I I I I 

CATTGATACTGGAGGGGAGATCAGTTTATCAGCAGAGAAAGGAACAATTACATTCCAAGG 

1081 + + + + + + 1140 

GTAACTATGACCTCCCCTCTAGTCAAATAGTCGTCTCTTTCCTTGTTAATGTAAGGTTCC 



Hpyl78III 

Mspl Alul Taal SfaNI Apol | 

BsaWI | CviJI Fokl| Bed | Tsp509I j 

II I II II II 

AAACCGGACGAGCTTACCGTTTTTGAATGGCATCCATCTTTTACAAAATGCTAAATTCCT 

1141 + + + + + + 1200 

TTTGGCCTGCTCGAATGGCAAAAACTTACCGTAGGTAGAAAATGTTTTACGATTTAAGGA 
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Figure 6E 



Dpnl 
Sau3AI 
Alwl 
Apol | 
Tsp509I | 

Tsp509I BciVI Sfcl | | | | Hpyl88IX 

I I III 

GAAATTACAGGCGAGAAATGGATACTCTATAGAATTTTATGATCCTATTACTTCTGAAGC 

1201 + + + + + + 1260 

CTTTAATGTCCGCTCTTTACCTATGAGATATCTTAAAATACTAGGATAATGAAGACTTCG 



Eco57I 
Muni | 

Tsp5 09l| Dpnl 

AccI | j BstYI | NlalV 

SimI | || Sau3AI j Rsal Avail | 

Bed || || Alwl | | TatI | Sau96l| 

I I I II III II II 

AGATGGGTCTACCCAATTGAATATCAACGGAGATCCTAAAAATAAAGAGTACACAGGGAC 

1261 + + + + + + 1320 

TCTACCCAGATGGGTTAACTTATAGTTGCCTCTAGGATTTTTATTTCTCATGTGTCCCTG 



Bfal 
Avrll 
BsaJI 
Styl 
Dpnl 
Sau3AI | 
Bpml | j 
Plel || j || Dral 
Alwl | | | | j | HaeIV| 
Hpyl78III Bfal || || | || Hin4l| 

BsmFI | Hinfl | || || | || Msel | 

II I I II II I 

CATACTCTTTTCTGGAGAAAAGAGTCTAGCAAACGATCCTAGGGATTTTAAATCTACAAT 

GTATGAGAAAAGACCTCTTTTCTCAGATCGTTTGCTAGGATCCCTAAAATTTAGATGTTA 



1380 



PstI 
CviRI 
Sfcl 
BciVI 



BseMII 
Hindi 
Mnll 
Maell | 
Hpyl88IX | j 
Ddel | | | 
II II 



1381 



Msel 

Ddel Mnll | 
I I I 

CCCTCAGAACGTCAACCTGTCTGCAGGATACTTAGTTATTAAAGAGGGGGCCGAAGTCAC 
+ + + + + + 

GGGAGTCTTGCAGTTGGACAGACGTCCTATGAATCAATAATTTCTCCCCCGGCTTCAGTG 



Maelll 
Tsp4 5I 
CviJI 
Haelll 
NlaIV| 
Sau96I | j 
III 



1440 
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Figure 6F 



Apol 
Tsp509I 
Taal Bpml | 

I I I 



Dpnl 
Sau3AI 
BsmAI 
ScrFI 
EcoRII | 
BsaXI | | 



Drdll 
NlaIV| 

II 



% 



1 



% 



3) 

o 



Alwl 

I I I I I I 
AGTTTCAAAATTCACGCAGTCTCCAGGATCGCATTTAGTTTTAGATTTAGGAACCAAACT 

1441 + + + + + + 1500 

TCAAAGTTTTAAGTGCGTCAGAGGTCCTAGCGTAAATCAAAATCTAAATCCTTGGTTTGA 



Bbsl 

Ddel BsrDI |MboII 
CviJI | Mnll | j Bed | 
II I I I II 



Hpyl7 8III 
CviJI | 
Hael | 
Haelll Nrul 
StuI Thai 



Alul 
CviJI 
Msel 
Af III 



Mnll 
I 



Smll 
Alul | 
CviJI | 
Fokl | | 

I I I 



GATAGCCTCTAAGGAAGACATTGCCATCACAGGCCTCGCGATAGATATAGATAGCTTAAG 

1501 + + + + + + 1560 

CTATCGGAGATTCCTTCTGTAACGGTAGTGTCCGGAGCGCTATCTATATCTATCGAATTC 

Alul 
AlwNI 
CviJI 
MspAlI 
PvuII 
Mnll | 
Fnu4HI | j 
Tsel| | | 
I I I I 

CTCATCCTCAACAGCAGCTGTTATTAAAGCAAACACCGCAAATAAACAGATATCCGTGAC 

1561 + + + + + + 1620 

GAGTAGGAGTTGTCGTCGACAATAATTTCGTTTGTGGCGTTTATTTGTCTATAGGCACTG 



Bbvl 
Msel I 



I I 



Acil 
Mwol | 
II 



Plel 
Maelll 
Tsp4 5I 
ECORV | 



Tthlllll 



I I 
I I 



Apol 
Tsp509I 
MboII 
Hpyl8 8IX 

Ddel 
Dpnl 
Bglll 
BstYI 

Sfcl Sau3AI 
Hinfl | Bsrl BsrDI | 

II III 
GGACTCTATAGAACTTATCTCGCCTACTGGCAATGCCTATGAAGATCTCAGAATGAGAAA 

1621 + + + + + + 1680 

CCTGAGATATCTTGAATAGAGCGGATGACCGTTACGGATACTTCTAGAGTCTTACTCTTT 
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Figure 6G 



Neil 
ScrFI 
BsaJI 
Mspl 
BslI 
CviJI 
NlalV 
ScrFI 
CviJI 
EcoRII 

Hin4I Sthl32I 
BseMII Maell | Mnll | 

III II 
TTCACAGACGTTCCCTCTGCTCTCTTTAGAGCCTGGAGCCGGGGGTAGTGTGACTGTAAC 

1681 + + + + + + 1740 

AAGTGTCTGCAAGGGAGACGAGAGAAATCTCGGACCTCGGCCCCCATCACACTGACATTG 



Maelll 
Maelll Taal 
Tsp45I Bpml | 

I 1 1 



BsmFI 



Mspl 
BsaWI | 
BsrFI | 
PinAI | 
II 



Alul 
CviJI 
Bsp24I 
Cjel 
CjePI 
Tsp509I | 
Muni | | 

Tsp5091 | | 

I I I 



Bpml BslI 

I II I I 

TGCTGGAGATTTCCTACCGGTAAGTCCCCATTATGGTTTTCAAGGCAATTGGAAATTAGC 

1741 + + + + + + 1800 

ACGACCTCTAAAGGATGGCCATTCAGGGGTAATACCAAAAGTTCCGTTAACCTTTAATCG 



Apol 
Cjel 
EcoRI 
Tsp509I 
CjePI | 

CjePI Bsp24l|| CjePI Bfal 

Mmel AlwNI BsrI |MboII |j| Tsp509I | CviJI | 

I I I I I III I I I I 

TTGGACAGGAACTGGAAACAAAGTTGGAGAATTCTTCTGGGATAAAATAAATTATAAGCC 

1801 + + + + + + 1860 

AACCTGTCCTTGACCTTTGTTTCAACCTCTTAAGAAGACCCTATTTTATTTAATATTCGG 



Alol HaelV 
Apol | Bsml Hin4I 

Tsp509I j RleAI Sfcl| Alwl | 

III II I I 

TAGACCTGAAAAAGAAGGAAATTTAGTTCCTAATATCTTGTGGGGGAATGCTGTAGATGT 

1861 + + + + + + 1920 

ATCTGGACTTTTTCTTCCTTTAAATCAAGGATTATAGAACACCCCCTTACGACATCTACA 
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Figure 6H 



BspMI 
Dpnl 
BstYI | 
Sau3AI | 
Hpyl8 8IX| 
II 



1921 



SfaNI 
Alul 
CviJT 
TaqI 
Nsil | 
CviRI | | 

Msel I I I I Nlalll | | 

II I I I I III 

CAGATCCTTAATGCAGGTTCAAGAGACCCATGCATCGAGCTTACAGACAGATCGAGGGCT 

+ + + + + + 1980 

GTCTAGGAATTACGTCCAAGTTCTCTGGGTACGTAGCTCGAATGTCTGTCTAGCTCCCGA 



SimI 
Hpyl7 8III 

Bsal 
BsmAI 
CviRI | 

I I 



TaqI 
Dpnl | 
Sau3AI | | 
Mnll | | | CviJI 
I I II I 



Apol 
Tsp509I 
MboII 
Tsp509I 
Clal 
TaqI 
Dpnl | Alwl 
Sau3AI | | Bed | 
I II II 



Nlaiv 
Rsal 

Bbsl BanI | 

|XmnI Nlalll Hpyl88IX Mnll |MboIl| | 

I I I I I II I 

GTGGATCGATGGAATTGGGAATTTCTTCCATGTATCTGCCTCCGAAGACAATATAAGGTA 

1981 + + + + + + 2040 

CACCTAGCTACCTTAACCCTTAAAGAAGGTACATAGACGGAGGCTTCTGTTATATTCCAT 



Taal Acil Dpnl BpulOI 

Kpnl | MspAlI Sau3AI | Ddel 

ll l III 

CCGTCATAACAGCGGTGGATATGTTCTATCTGTAAATAATGAGATCACACCTAAGCACTA 

2041 + + + - + + + 2100 

GGCAGTATTGTCGCCACCTATACAAGATAGACATTTATTACTCTAGTGTGGATTCGTGAT 



Bed 
TaqI | 



BsmAI 



Acil 



I 



TACTTCGATGGCATTTTCCCAACTCTTTAGTAGAGACAAGGACTATGCGGTTTCCAACAA 

2101 + + + + + + 2160 

ATGAAGCTACCGTAAAAGGGTTGAGAAATCATCTCTGTTCCTGATACGCCAAAGGTTGTT 



Alwl 



Hin4I 



Dpnl 
Sau3AI | 
Mmel | j 



I 



I I I I 



BslI 
Bfal 
Avrll | 
BsaJI | 
Styl| 

II 



XmnI 
Sspl | 
Mnll | | 

I II 



CGAATACAGAATGTATTTAGGATCGTATCTCTATCAATATACAACCTCCCTAGGGAATAT 

2161 + + + + + + 2220 

GCTTATGTCTTACATAAATCCTAGCATAGAGATAGTTATATGTTGGAGGGATCCCTTATA 
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figure 61 



J 



Hinf I 
Tf il 
Hpyl78III | 
Maell | j MboII 
Maelll Bce83I | j |Hpyl78IIl| 

Thai | Sthl32I | | | j Smll || 

II I I I I I III 

TTTCCGTTATGCTTCGCGTAACCCTAATGTAAACGTCGGGATTCTCTCAAGAAGGTTTCT 

2221 + + + + + + 2280 

AAAGGCAATACGAAGCGCATTGGGATTACATTTGCAGCCCTAAGAGAGTTCTTCCAAAGA 



Mnll 



Nlalll 



TCAAAATCCTCTTATGATTTTTCATTTTTTGTGTGCTTATGGTCATGCCACCAATGATAT 

2281 + + + + + + 2340 

AGTTTTAGGAGAATACTAAAAAGTAAAAAACACACGAATACCAGTACGGTGGTTACTATA 



HphI 
Alul | 
Cvi JI | 
MspAlI | 

Apol Pvullj Muni Sfcl 

Tsp509I Cjel|j Tsp509I Cvi JI | 

I III I II 

GAAAACAGACTACGCAAATTTCCCTATGGTGAAAAACAGCTGGAGAAACAATTGTTGGGC 

2341 + + + + + + 2400 

CTTTTGTCTGATGCGTTTAAAGGGATACCACTTTTTGTCGACCTCTTTGTTAACAACCCG 



Nlalll 
Nspl 
Mwol SphI 
Mnll Cjel |Cac8I | 
Bpml |AciI | |Hin4I j Mnll BplI 

I I I I I I I I I 

TATAGAGTGCGGAGGGAGCATGCCTCTATTGGTATTTGAGAACGGAAGACTTTTCCAAGG 

2401 + + + + + + 2460 

ATATCTCACGCCTCCCTCGTACGGAGATAACCATAAACTCTTGCCTTCTGAAAAGGTTCC 



BanI 
MboII 
BsaJI 
Styl 
Bbsl | 
Fokl I 



Bsp24I 
Cjel 
CjePI 

Bed BsmAI | 

NlalV | Tsp509I Nlalll BsmBI j Bcefl 

II I I I I I 

TGCCATCCCATTTATGAAACTACAATTAGTTTATGCTTATCATGGAGATTTCAAAGAGAC 

2461 + + + + + + 2520 

ACGGTAGGGTAAATACTTTGATGTTAATCAAATACGAATAGTACCTCTAAAGTTTCTCTG 
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Figure 6J 



CviJI 
Haelll 
Bed 
Eael 
Gdill 
PstI | 
CviRI | j 
Sfcl | | | 
I I I 



Cjel 
CjePI | 
Bsp24I | j 

III 



Msel 

I 



Clal 
TaqI 

I 



Rsal 
I 



Bfal 



GACTGCAGATGGCCGTAGATTTAGTAATGGGAGTTTAACATCGATTTCTGTACCTCTAGG 

2521 + + + + + + 

CTGACGTCTACCGGCATCTAAATCATTACCCTCAAATTGTAGCTAAAGACATGGAGATCC 



2580 



Fokl 



Mnll 
I 



Cac8I BseMII 

Alul | Hpyl78III Rsal | 

CviJI | Ddel |TatI | j 

II I I I I I 



CATACGCTTTGAGAAGCTGGCACTTTCTCAGGATGTACTCTATGACTTTAGTTTCTCCTA 

2581 + + + + + + 2640 

GTATGCGAAACTCTTCGACCGTGAAAGAGTCCTACATGAGATACTGAAATCAAAGAGGAT 



Bbvl 
Dpnl 
NlalV 
BamHI 
BstYI 
Sau3AI 
Alwl | 

I I 



Fnu4HI 
Alul | 
CviJI | 
Nlalll Tsel| 
Hpyl78III Alwl | || Alwl |Mnll| 

I I I II I I II 

TATTCCTGATATTTTCCGTAAGGATCCCTCATGTGAAGCTGCTCTGGTGATTAGCGGAGA 

2641 + + + + + + 2700 

ATAAGGACTATAAAAGGCATTCCTAGGGAGTACACTTCGACGAGACCACTAATCGCCTCT 



Acil 
Plel | Hinf I 
BsmAI | |HphI | 

I II II 



CviJI 
ScrFI | 
EcoRII I 



Hpyl78III 
BsaAI 
Mae 1 1 



Af 1III 
Fnu4HI | 
Tsel| j 
Mspl | | | 

III I 



Bbvl 



Nlalll 
Nspl 
I 



Sthl32I 



SimI 
BscGI | 
II 



CTCCTGGCTTGTTCCGGCAGCACACGTATCAAGACATGCTTTTGTAGGGAGTGGAACGGG 

2701 + + + + + + 2760 

GAGGACCGAACAAGGCCGTCGTGTGCATAGTTCTGTACGAAAACATCCCTCACCTTGCCC 



CO 



Figure 6K 



BseMII 
Msel I 
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Mnll 
Banll 
BsiHKAI 
Bspl286I 
SacI 
Alul | 
CviJI | 
Ddel | | | Taql 
I I I I I 



BsmI 
Acil | 
Fnu4HI | 
Taul j 
II 



TCGGTATCACTTTAACGACTATACTGAGCTCTTATGTCGAGGAAGTATAGAATGCCGCCC 

2761 + + + + + + 2820 

AGCCATAGTGAAATTGCTGATATGACTCGAGAATACAGCTCCTTCATATCTTACGGCGGG 



Tsp509I 
Bfal I 
BslI j 
NlaIIl| | 



Apol 
Tsp509I 



BsrDI 



Taal Tsp509I Cjel 

I II II 

CCATGCTAGGAATTATAATATAAACTGTGGAAGCAAATTTCGTTTTTAGAAGGTTTCCAT 

2821 + + + + + + 2880 

GGTACGATCCTTAATATTATATTTGACACCTTCGTTTAAAGCAAAAATCTTCCAAAGGTA 



2881 



Alwl 
Cjel 
Msel 
Dpnl 
BstYI 
Sau3AI 
Hpyl78III | 
Mspl | j 

BsaWI III | || | Dpnl 
BspEI III | || | BspGI Sau3AI | 

Nlaiv| | | | | III ScrFI |HaeIV | | 

Xcml Drdll| | | | | | III EcoRII | |Hin4I | | Alwl 

I Mill II Ml III I 

TGCCTGTGTGGTTCCGGATCTTAACTATAAATCCTGGACTATGGATCATAGGCATTGGGT 

+ + + + + + 2940 

ACGGACACACCAAGGCCTAGAATTGATATTTAGGACCTGATACCTAGTATCCGTAACCCA 



Hpyl78III 
Taql 
I 

TTCTCGAACT 

2941 + 2950 

AAGAGCTTGA 
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Figure 7A 

Restriction enzyme analysis of CPN100325 (RY 62 



SEQ ID NO. 7) 



BsiEI 
Pvul 
Dpnl | 
Sau3AI | | 

BsrDI TaqI | | | 

I II II 

GTGGGGGCATTGCTGGGGGAAAAGCACATTTCGATCGCATTGATAATCTTATCAGTCCAA 

CACCCCCGTAACGACCCCCTTTTCGTGTAAAGCTAGCGTAACTATTAGAATAGTCAGGTT 



60 



61 



BslI 
EcoNI 
Mnll 

CjePI ScrFI 
Hpyl78III | Bsll| 
Fokl | | EcoRII | | 

CjePI Tthlllll SfaNI || j MboIl||| 

I I I II I I I I I 

AGCAACCAAGCAAAGAAAGGTGGTGGGGTTTATCTTGAAGATGCCCTCATCCTGGAAAAG 

TCGTTGGTTCGTTTCTTTCCACCACCCCAAATAGAACTTCTACGGGAGTAGGACCTTTTC 



120 



Sf CI 
Alul | 
Cvi JI | 

Fnu4HI | | BpulOI 
Cjel III Ddel 
AlwNI BsmAI | Tsel | || Bbvl Cjel| 

I I I II II I II 

GTTATTACAGGTTCTGTCTCACAAAATAGCAGCTACAGAAAGTGGTGGGGGTATCTACGC 

121 + + + + + + 

CAATAATGTCCAAGACAGAGTGTTTTATCGTCGATGTCTTTCACCACCCCCATAGATGCG 



180 



Tsp509I 
Alul 
CviJI 
Hindlll 
ScrFI 
EcoRII | 
Alul | | 

CviJI || || | TaqI 

I I I 

TAAGGATATTCAACTACAAGCTCTACCTGGAAGCTTCACAATTACCGATAATAAAGTCGA 

181 + + + + + + 240 

ATTCCTATAAGTTGATGTTCGAGATGGACCTTCGAAGTGTTAATGGCTATTATTTCAGCT 
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Figure 7B 



Maelll 
Tsp4 5I 
Alul | 

Bfal Tsp509I Bsrl CviJI | 

Spel| Bfal | Acelll SfaNI TspRI | j 

II II I I III 

AACTAGTCTTACTACTAGCACTAATTTATATGGTGGGGGCATCTATTCCAGTGGAGCTGT 

241 + + + + + + 

TTGATCAGAATGATGATCGTGATTAAATATACCACCCCCGTAGATAAGGTCACCTCGACA 



300 



NgoGV 
NlalV 

Hpyl78III | Fokl 

I I I 
CACGCTAACCAATATATCTGGAACCTTTGGCATTACAGGAAACTCTGTTATCAATACAGC 

301 + + + + + + 360 

GTGCGATTGGTTATATAGACCTTGGAAACCGTAATGTCCTTTGAGACAATAGTTATGTCG 



ScrFI 
BsaJI | 
EcoRII | 
SfaNI | | 

I I I 



361 



Btrl BsmAI 

CviRI Fokl CviRI Maell | BsmBI 

II I II I 

GACATCCCAGGATGCAGATATACAAGGTGGGGGCATTTATGCAACCACGTCTCTCTCAAT 

CTGTAGGGTCCTACGTCTATATGTTCCACCCCCGTAAATACGTTGGTGCAGAGAGAGTTA 



Taqll Fnu4HI 
Bbvl | Tsel | 

II II 

AAATCAATGTAATACACCCATTCTATTTAGCAACAACTCTGCTGCCACTAAAAAAACATC 

421 + + + + + + 480 

TTTAGTTACATTATGTGGGTAAGATAAATCGTTGTTGAGACGACGGTGATTTTTTTGTAG 



Tsp509I 



CviJI 
Bbvl | 
Mwol | | 
MboII | | | 
I I I I 



Maelll 
PstI 
CviRI 
Fnu4HI 
Sfcl 
MspAlI | 
Tsel | 
Acil | | 
I II 



Hin4I 
Hpyl78III | 
Taql | | 
I I I 



AACAACAAAGCAAATTGCTGGTGGGGCTATCTTCTCCGCTGCAGTAACTATCGAGAATAA 

481 + + + + + + 540 

TTGTTGTTTCGTTTAACGACCACCCCGATAGAAGAGGCGACGTCATTGATAGCTCTTATT 




gure 7C 
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CviJI 
Ddel | 



Tsp509I 
Msel 
Mmel | 
BseMIlj j 
Bpll|| | 
III I 



Sfcl 
AlwNI 
BstAPI 
Fnu4HI | 
Tsel| | 
Mwol | | | 
Sfcl | | | 
I I II 



Mwol 



541 



Acil Hpyl8 8IX 

II III I I I I 

CTCTCAGCCCATTATTTTCTTAAATAATTCCGCAAAGTCGGAAGCAACTACAGCAGCAAC 

GAGAGTCGGGTAATAAAAGAATTTATTAAGGCGTTTCAGCCTTCGTTGATGTCGTCGTTG 



600 



BseRI 
Alul [ 
CviJI 
Fnu4HI 

BsrDI 

Alul CviJI | CviRI 

CviJI NgoGV | | Mwol | | | Bbvl 

Mnll| Nlaivj j Tselj || Maelll | Msel 

II III II II II I 

TGCAGGAAATAAAGATAGCTGTGGAGGAGCCATTGCAGCTAACTCTGTTACTTTAACAAA 

ACGTCCTTTATTTCTATCGACACCTCCTCGGTAACGTCGATTGAGACAATGAAATTGTTT 



Bbvl 
PstI | 
CviRI | | 

I II 



601 



660 



Tsp509I 
Dral | 
Msel | j 
II I 



AlwNI 
Mnll | 
CviRI | j BsrI 
III I 



Bpml 
BseRI | 
CviJI | | 

I I I 



TAACCCTGAAATAACCTTTAAAGGAAATTATGCAGAAACTGGAGGAGCGATTGGCTGTAT 

661 + + + + + + 720 

ATTGGGACTTTATTGGAAATTTCCTTTAATACGTCTTTGACCTCCTCGCTAACCGACATA 

Dpnl Sthl32I CviRI 

Sau3AI | HphI CviJI BscGI Mnll | BsmAI | Taal 

I I I I I II I I I 

TGATCTTACTAATGGCTCACCTCCCCGTAAAGTCTCTATTGCAGACAACGGTTCTGTCCT 

721 + + + + + + 780 

ACTAGAATGATTACCGAGTGGAGGGGCATTTCAGAGATAACGTCTGTTGCCAAGACAGGA 



Hpyl78III 
I 



Acil 
Mnll | 
Msel | Thai 
I I I 



Haell 

Hhal| Hin4I 
Hin4I | | BsmAI Bpml [ 

III I II 



ECORV 
Cjel | 
Clal j 
Taql | 

I I 



TTTTCAAGACAACTCTGCGTTAAATCGCGGAGGCGCTATCTATGGAGAGACTATCGATAT 
AAAAGTTCTGTTGAGACGCAATTTAGCGCCTCCGCGATAGATACCTCTCTGATAGCTATA 



840 
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Figure 7D 



Cjel 

ScrFI Maelll | Bed Tsp509I 

EcoRII | MboII | | Earl Nlalll | CviRI | 

II III I II II 

CTCCAGGACAGGTGCGACTTTCATCGGTAACTCTTCAAAACATGATGGAAGTGCAATTTG 

841 + + + + + + 

GAGGTCCTGTCCACGCTGAAAGTAGCCATTGAGAAGTTTTGTACTACCTTCACGTTAAAC 



900 



CviJI Hhal Maelll 

I I I 

CTGTTCAACAGCCCTAACTCTTGCGCCAAACTCCCAACTTATCTTTGAAAACAATAAGGT 

901 + + + + + + 960 

GACAAGTTGTCGGGATTGAGAACGCGGTTTGAGGGTTGAATAGAAACTTTTGTTATTCCA 



Tsp509I 
CviRI 
Fnu4HI 
Alul | 
CviJI | 
Tsel | 
I I 



CviJI 



Alul 
CviJI 
Hindlll | 
I I 



Tsp509I 
Bbvl | 
AceIIl| | 
I I I 



TACGGAAACCACAGCCACTACAAAAGCTTCCATAAATAATTTAGGAGCTGCAATTTATGG 

961 + + + + + + 1020 

ATGCCTTTGGTGTCGGTGATGTTTTCGAAGGTATTTATTAAATCCTCGACGTTAAATACC 



BsmAI 
I 



Aatll 
Maelll 
Tsp4 5I 
BsaHI 
Mae 1 1 
Maelll 
Tsp45I 
Bf al | 
Spel| | 
II I 



BseMII 
I 



Ddel 
Alul | 
CviJI | 
MspAlI | 
PvuII 



Msel 
I 



AAATAATGAGACTAGTGACGTCACTATCTCTTTATCAGCTGAGAATGGAAGTATTTTCTT 

1021 + + + + + + 1080 

TTTATTACTCTGATCACTGCAGTGATAGAGAAATAGTCGACTCTTACCTTCATAAAAGAA 



Dral 



CviRI 



Tthlllll 
PstI | 
CviRI | | 
Sfcl | | | 
I I I I 



Eco57I 
Apol | 
Tsp509I | 
Maell | | 

I I I 



TAAAAACAATCTATGCACAGCAACAAACAAATACTGCAGTATTGCTGGAAACGTAAAATT 

1081 + + + + + + 1140 

ATTTTTGTTAGATACGTGTCGTTGTTTGTTTATGACGTCATAACGACCTTTGCATTTTAA 
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Figure 7E 



Alul 
CviJI 
Hindlll | 
Mwol | | 

II I 



Alul 
CviJI 
Mwol | Sf aNI 

I I I 



Acll 

Mae 1 1 
Hindi 
Hpal 
Msel | 
CviRI | | 

I II 



1 



c 



TACAGCAATAGAAGCTTCAGCAGGGAAAGCTATATCTTTCTATGATGCAGTTAACGTTCC 

1141 + + + + + + 1200 

ATGTCGTTATCTTCGAAGTCGTCCCTTTCGATATAGAAAGATACTACGTCAATTGCAAGG 



Msel 
Tsp509I 
Alul 
CviJI 
Hpyl78III | 
Muni | | 

Bce83I Tsp509I Smll | | 



I 



Rsal 
Tat I | Mae I I 

I I I 



ACCAAAGAAACAATTGCTCAAGAGCTAAATTAAATGAAAAAGCGACAAGTACANGGACGT 

1201 + + + + + + 1260 

TGGTTTCTTTGTTAACGAGTTCTCGATTTAATTTACTTTTTCGCTGTTCATGTNCCTGCA 

Bsp24I 
CjePI 

Cjel| Maelll 
BsmFI | | Tsp45I BsaJI 

I II II 
TTCTANTTTCTGGGGGACTTCACGGAAATAAATCCCTATTCCACAGAAAGTCACTTCGCC 

1261 + + + + + + 1320 

AAGATNAAAGACCCCCTGAAGTGCCTTTATTTAGGGATAAGGTGTCTTTCAGTGAAGCGG 



Cjel 
CjePI 
Bsp24I | 
II 

CTNGGGAT 

1321 1328 

GANCCCTA 
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Restriction enzyme analysis of CPN100368 (RY 63 - SEQ ID NO. 8) 

Cjel 
Nlalll 
BsiHKAI 
Bspl286I 
BseSI | 
CviRI j 

Msel Taal ApaLI | j 

II III 
TTACTTGATTTATTTAACTGTATTCTCTATTGGTGCACCATGCTCCTAAAGCCACATGCT 

AATGAACTAAATAAATTGACATAAGAGATAACCACGTGGTACGAGGATTTCGGTGTACGA 



CviJI Nlalll 
Mwol | Nspl 
I I I 



Alul 
CviJI 

Hindlll | Sspl BsaJI 

Cjel | | Nlalll | Styl 

III II I 
ATGGGAGTATTTTTGATAAAAAGCTTTTCCCCAAAGACACATGAAATATTCTTTACCTTG 
61 + + + + + + 120 

TACCCTCATAAAAACTATTTTTCGAAAAGGGGTTTCTGTGTACTTTATAAGAAATGGAAC 



Fokl Dpnl 

Mnll | Fnu4HI BstYI | 

MboII CviJI | j CviJI | Sau3AI | 

CviJI | Earl | | | Bbvl Tsel | Fokl | j 

II II I I I II III 

GCTACTTACCTCTTCGGCTTTAGTTTTCTCCCTACATCCACTAATGGCTGCTAACACGGA 

CGATGAATGGAGAAGCCGAAATCAAAAGAGGGATGTAGGTGATTACCGACGATTGTGCCT 



180 



Hpyl8 8IX 
Alwl | 
HaelV | j 
Hin4I | j 
I I I 



Eco57I 
TspRI 
BsaJI 
Styl 
BsmI Bbvl 
Fnu4HI | BtsI | 
Hhal | | BstAPI | j 
Tsel | j Mwol j j 
III III 



TCTCTCATCATCCGATAACTATGAAAATGGTAGTAGTGGTAGCGCAGCATTCACTGCCAA 

181 + + + + + + 240 

AGAGAGTAGTAGGCTATTGATACTTTTACCATCATCACCATCGCGTCGTAAGTGACGGTT 



Hpyl88IX Fokl 
SfaNI | Hpyl78III | Bfal Bael 

II II II 

GGAAACTTCGGATGCTTCAGGAACTACCTACACTCTCACTAGCGATGTTTCTATTACGAA 

241 + + + + + + 300 

CCTTTGAAGCCTACGAAGTCCTTGATGGATGTGAGAGTGATCGCTACAAAGATAATGCTT 
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Figure 8B 



PstI Acelll 

CviRI | Tthlllll | 

Tsp509I Bael | | Alul CjePI | | 

CviRI | Sfcl | | CviJI Mnll Mmel | | j 

II III I I I I I I 

TGTATCTGCAATTACTCCTGCAGATAAAAGCTGTTTTACAAACACAGGAGGAGCATTGAG 

301 + + + + + + 

ACATAGACGTTAATGAGGACGTCTATTTTCGACAAAATGTTTGTGTCCTCCTCGTAACTC 



360 



361 



Bbvl 
Haell 
Hhal | 
Eco47III | | 
Cjel Ml 
I I III 

TTTTGTTGGAGCTGATCACTCATTGGTTCTGCAAACCATAGCGCTTACGCATGATGGTGC 
AAAACAACCTCGACTAGTGAGTAACCAAGACGTTTGGTATCGCGAATGCGTACTACCACG 



BseRI 



I 



Dpnl 
Bell 
Sau3AI 
Alul | 
CviJI | 

I I 



Drdll 
CjePI | CviRI 



Fnu4HI 
Bed | 
Msll| | 
Nlalll | |TseI | 
III II 



420 



Msel 
Plel 
Bpml 
BseMII 
Maelll 
Tsp4 5I 
CjePI 
Hinf I 



CjePI 
Msel | 
Tsp509I | | 
CviRI | I | Cjel 
II II I 



Tf il 

Alul Hpyl78III | 

CviJI Ddel | j 

Bsbl | Acelll| | | 
II II I I 



Hinf I 
Taql | 
I I 



TGCAATTAACAATACCAACACAGCTCTTTCTTTCTCAGGATTCTCGTCACTCTTAATCGA 

421 + + + + + + 

ACGTTAATTGTTATGGTTGTGTCGAGAAAGAAAGAGTCCTAAGAGCAGTGAGAATTAGCT 



480 



Alul 
CviJI 
Ddel | 
I I 



Acelll 
BseMII 
Sthl32I 
BslI | 
II 



Fnu4HI 
Taul 
Acil | 
I I 



Maelll 
Tsp45I 



Mnll 
Mnll | 
I I 



CTCAGCTCCAGCAACAGGAACTTCGGGCGGCAAGGGTGCTATTTGTGTGACAAATACAGA 

481 + + + + + + 540 

GAGTCGAGGTCGTTGTCCTTGAAGCCCGCCGTTCCCACGATAAACACACTGTTTATGTCT 
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Figure 8C 



Eco57I 
TspRI 
Maelll 
Tsp45I 
BsrI | 

Rsal HphI | | 

I III 
GGGAGGTACTGCGACTTTTACTGACAATGCCAGTGTCACCCTCCAAAAAAATACTTCAGA 

541 + + + + + + 60 

CCCTCCATGACGCTGAAAATGACTGTTACGGTCACAGTGGGAGGTTTTTTTATGAAGTCT 



Bbvl 
Acelll | 
Hpyl88IX| 
Mnll | j 

I il 



601 



AlwNI 
BstAPI 
PstI 
CviRI | 
BsaXl| | 

Fnu4HI Ml j Ddel 

Sf cl III | SfaNI | Alul 

Alul | | I I | Dpnl | | Cvi JI 

Cvi JI | | | | | Sau3AI | j | Fnu4HI | 

Tsel | | | | j Clal | | | | Hin4I | j 

Bed Mil |MwoI Sfcl Taqlj j j |Pflll08I Tsel j j 

I I I I I III II I I I I III 

AAAAGATGGAGCTGCAGTTTCTGCCTACAGCATCGATCTTGCTAAGACTACGACAGCAGC 

+ + H + + + 

TTTTCTACCTCGACGTCAAAGACGGATGTCGTAGCTAGAACGATTCTGATGCTGTCGTCG 



66 



Sfcl 
Apal 
Banll 
Bspl286I 
Bmgl 
BseSI 
CviJI 
Haelll 
NgoGV 
NlalV 
Eco0109I | 
NgoGV j 
Nlaivj 
Sau96l| j j Mnll 
EcoO109I III j Rsal | 

Sau96I | j j |CjeI | | Sfcl 
Acil Ml | |TatI | j CjePI | 

I III I I III II 
TCTCTTAGATCAAAATACTAGCACAAAAAATGGCGGGGCCCTCTGTAGTACAGCAAACAC 

661 + + + + + + 72 

AGAGAATCTAGTTTTATGATCGTGTTTTTTACCGCCCCGGGAGACATCATGTCGTTTGTG 



Acelll 
Dpnl 

Bbvl | Faul 
Sau3AI | | Sthl32I | 

Ddel | | | Cjel Bfal | | 

I III I I II 



\ 



Figure 8D 
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CjePI 
BseMII 
BseRI 

Tthlllll BstEII 
BsaJI | Maelll 
Styl |Hpyl78III Taal 
Taal | | Ddel |Tsp45I 
III II I 



HphI 



Sfcl 
Mnll | 

I I 



721 



TACAGTCCAAGGAAACTCAGGAACGGTGACCTTCTCCTCAAATACTGCTACAGATAAAGG 
ATGTCAGGTTCCTTTGAGTCCTTGCCACTGGAAGAGGAGTTTATGACGATGTCTATTTCC 



780 



Dpnl Bfal 
BstYI | Cac8I | Maelll 

Sau3AI | Alwl SfaNI | j Hinfl | Plel 

III III III 

TGGGGGGATCTACTCAAAAGAAAAGGATAGCACGCTAGATGCCAATACAGGAGTCGTTAC 

ACCCCCCTAGATGAGTTTTCTTTTCCTATCGTGCGATCTACGGTTATGTCCTCAGCAATG 



840 



Hpyl8 8IX 

Banll 
BsiHKAI 

BscGI Bspl286I 
Tthlllll | Sad 
CviRI | | Alul | 

Sthl32I | | j CviJI | | BsaBI Bce83I 

I I I I I II I I 

CTTCAAATCTAATACTGCAAAGACGGGGGGTGCTTGGAGCTCTGATGACAATCTTGCTCT 

841 + + + + + + 900 

GAAGTTTAGATTATGACGTTTCTGCCCCCCACGAACCTCGAGACTACTGTTAGAACGAGA 



Rsal 

Mspl Smll Seal 

BsrFl| Bsbl |TatI |Hpyl78III 
II I I I I I 



Fnu4HI 
Bpull02I 
Ddel 
CviJI | 
Mspl | | 
BsrFI | | |TseI 
I I I I I 



BseMII 
Mwol | 



TACCGGCAACACTCAAGTACTTTTTCAGGAAAATAAAACAACCGGCTCAGCAGCACAGGC 
ATGGCCGTTGTGAGTTCATGAAAAAGTCCTTTTATTTTGTTGGCCGAGTCGTCGTGTCCG 



960 



Sthl32I 
Mspl | 
Neil j 

Bbvl ScrFI j Sfcl 

III I 
AAATAACCCGGAAGGTTGTGGTGGGGCAATCTGTTGTTATCTTGCTACAGCAACAGACAA 

961 + + + + + + 1020 

TTTATTGGGCCTTCCAACACCACCCCGTTAGACAACAATAGAACGATGTCGTTGTCTGTT 




Figure 
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Alul 
CviJI 
BseMII 
Hpyl78III 
Hinfl 



Tf il 

CviJI Hpyl88IX | 
BsrI | Ddel | | 

II III 



Bfal 
Spel | Cjel 
I I I 



AACTGGATTAGCCATTTCTCAGAATCAAGAAATGAGCTTCACTAGTAATACAACAACTGC 

1021 + + + + + + 1080 

TTGACCTAATCGGTAAAGAGTCTTAGTTCTTTACTCGAAGTGATCATTATGTTGTTGACG 



Cjel 
Cjel | 

Mwol | j Hpyl78III 
Dpnl | j j Rsal | TaqI 

Sau3AI | j j j TatI | | Bed Fokl Cjel | 

I I I I I I I I I I II 

GAATGGTGGAGCGATCTACGCTACTAAATGTACTCTGGATGGAAACACAACTCTTACCTT 

1081 + + + + + + 1140 

CTTACCACCTCGCTAGATGCGATGATTTACATGAGACCTACCTTTGTGTTGAGAATGGAA 



Hpyl88IX 
Dpnl | 
Sau3AI | | 
I I I 



Fokl 
Alul 
CviJI 
Ecil | 
Acil| j 
II I 



AlwNI 



CGATCAGAATACTGCGACAGCAGGATGTGGCGGAGCTATCTATACAGAAACTGAAGATTT 

1141 + + + + + + 1200 

GCTAGTCTTATGACGCTGTCGTCCTACACCGCCTCGATAGATATGTCTTTGACTTCTAAA 



Maelll 
Taal 
Tsp45I 
NgoGV | 
NlalV j 
BscGI | j 
Eco57I I 



Eco57I I I I I BsaHI 

Sthl32I | I I I I Narl 

Msel || I I I I BanI | 

Af III III I I I I Fnu4HI I j 

MboII III I I I I Taul | | 

Smll j | | Rsal | j | Acil | j j 

II I I I I I I I I I I 

TTCTCTTAAGGGAAGTACGGGAACCGTGACCTTCAGCACAAATACAGCAAAGACAGGCGG 

1201 + + + + + + 1260 



AAGAGAATTCCCTTCATGCCCTTGGCACTGGAAGTCGTGTTTATGTCGTTTCTGTCCGCC 



Figure 8F 

Haell 
Hhal | 
NgoGV| | 
Nlaivj | 
III 
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Cac8I 
Alul | 
CviJI | 



1261 



BsrI 

Acelll | BspMI 

I I I 
CGCCTTATATTCTAAAGGAAACAGCTCGCTGACTGGAAATACCAACCTGCTCTTTTCAGG 

+ + + + + + 1320 

GCGGAATATAAGATTTCCTTTGTCGAGCGACTGACCTTTATGGTTGGACGAGAAAAGTCC 



Sthl32I 
Tsp509I 
MboII 
Apal 
Banll 
Bspl286I 
Aval 
Bmgl 
BseSI 
CviJI 
Haelll 
NgoGV 
NlalV 
Sau96I 
BscGI 
Sau96I 
Eco57I 
Alul | 

CviJI I I I I I I I M Hpyl78III 
Sthl32I | I | I I | I | || Mnll | Acil 

I I lllllll II III 

GAACAAAGCTACGGGCCCGAGTAATTCTTCAGCAAATCAAGAGGGTTGCGGTGGGGCAAT 

1321 + + + + + + 1380 

CTTGTTTCGATGCCCGGGCTCATTAAGAAGTCGTTTAGTTCTCCCAACGCCACCCCGTTA 



CviJI 
Bf al | 
Mwol 



Dpnl 
NgoGV 
NlalV 
BamHI 
BstYI 
Sau3AI 
Hpyl7 8III 
HaelV 
Hin4I 
Alwl | 
Hinfl| j 
Tfilj | 
I I I 



1381 



Clal 

Alwl TaqI CviRI 

I I I 
CCTAGCCTTTATTGATTCAGGATCCGTAAGCGATAAAACAGGACTATCGATTGCAAACAA 
+ + + + + + 

GGATCGGAAATAACTAAGTCCTAGGCATTCGCTATTTTGTCCTGATAGCTAACGTTTGTT 



1440 
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J 



Figure 



CviRI 
Fnu4HI 

Bbvl Tsel | 

CviJI | Bf al Mnll | j 
Tthlllll ||Spel|Cjel| || 
I II II II II 



Taal 



Cjel 
Cjel 
Mwol 
CjePI | 
Dpnl | j 
Sau3AI | | | 
I I II 



CCAAGAAGTCAGCCTCACTAGTAATGCTGCAACAGTAAGTGGTGGTGCGATCTATGCTAC 

1441 + + + + + + 1500 

GGTTCTTCAGTCGGAGTGATCATTACGACGTTGTCATTCACCACCACGCTAGATACGATG 



CjePI 
NgoGV 

NlalV Eco57I 
Rsal CviJI | Beef I | Mnll 

TatI | BsrI || Cjel | | Beef I | 

II III III II 

CAAATGTACTCTAACTGGAAACGGCTCCCTGACCTTTGACGGCAATACTGCTGGAACTTC 

1501 + + + + + + 1560 

GTTTACATGAGATTGACCTTTGCCGAGGGACTGGAAACTGCCGTTATGACGACCTTGAAG 



Maelll 
Eco57I Taal 

Dpnl Rsal Tsp45I 

Sau3AI | TatI | NgoGV | 

Hpyl78III Hin4I | j AlwNI MboII Eco57I | j NlalV j 

I I I I I I Ml II 

AGGAGGGGCGATCTATACAGAAACTGAAGATTTTACTCTTACAGGAAGTACAGGAACCGT 

1561 + + + + + + 

TCCTCCCCGCTAGATATGTCTTTGACTTCTAAAATGAGAATGTCCTTCATGTCCTTGGCA 



1620 



1621 



Haell 
Hhal 
NgoGV 
NlalV 
BsaHI 
Narl 
BanI 
Fnu4HI | 
Taul j 
Acil | | 
I I I 

GACCTTCAGCACAAATACAGCAAAGACAGGCGGCGCCTTATATTCTAAAGGCAACAACTC 
+ + + + + + 

CTGGAAGTCGTGTTTATGTCGTTTCTGTCCGCCGCGGAATATAAGATTTCCGTTGTTGAG 



1680 
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Figure 8H 



Sthl32I 
Tsp509I 
MboII 
Apal 



BspMI Sthl32I 



Banll 
Bspl286I 
Aval 
Bmgl 
BseSI 
CviJI 
Haelll 
NgoGV 
NlalV 
Sau96I 
BscGI 
Sau96I 
Eco57I 
Alul | 
CviJI j 



I 



TCTGTCTGGTAATACCAACCTGCTCTTTTCAGGGAACAAAGCTACGGGCCCGAGTAATTC 

1681 + + + + + + 1740 

AGACAGACCATTATGGTTGGACGAGAAAAGTCCCTTGTTTCGATGCCCGGGCTCATTAAG 



AlwNI 
Plel [ 
Bcgl 
Hin4I 
Hinf I | 

Hpyl78III Acil Hpyl78III | j 

Mnll | Bcgl | Smll | j j 

I I I I II I I 

TTCAGCAAATCAAGAGGGTTGCGGTGGGGCAATCCTATCGTTTCTTGAGTCAGCATCTGT 

1741 + + + + + + 1800 

AAGTCGTTTAGTTCTCCCAACGCCACCCCGTTAGGATAGCAAAGAACTCAGTCGTAGACA 



Bce83I 

Rsal| Hinf I 

Seal | Maell | 

SfaNI || Hpyl78III MboII | | Plel 

TatI j j Plel Hinfl | CjePI || j BsmAl|CjeI 

I II I I I I II I II I 

AAGTACTAAAAAAGGACTCTGGATTGAAGATAACGAAAACGTGAGTCTCTCTGGTAATAC 

1801 + + + + + + 1860 

TTCATGATTTTTTCCTGAGACCTAACTTCTATTGCTTTTGCACTCAGAGAGACCATTATG 
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Figure 81 



CjePI 
CviRI Taal 



Cjel 
Mwol 
CjePI 
Dpnl | 
Sau3AI | | 
Acil | | | 

I III 



CjePI 
Hinf I 
Cjel 



Plel 
Nlalll 
CviRI | 
BsiHKAI | | 
Bspl286I | | 
I I I 



TGCAACAGTAAGTGGCGGTGCGATCTATGCGACCAAGTGTGCTCTGCATGGAAACACGAC 

1861 + + + + + + 1920 

ACGTTGTCATTCACCGCCACGCTAGATACGCTGGTTCACACGAGACGTACCTTTGTGCTG 



Bed 



PstI 
CviRI 
Sfcl 
Cjel| 
Mnll | 
Mwol | 
II 



Dpnl 
Sau3AI | 
Hin4I | | 
Mwol | | I 
I I II 



BseRI 
I 



TCTTACCTTTGATGGCAATACTGCCGAAACTGCAGGAGGAGCGATCTATACAGAAACCGA 

1921 + + + + + + 1980 

AGAATGGAAACTACCGTTATGACGGCTTTGACGTCCTCCTCGCTAGATATGTCTTTGGCT 



Maelll 
Taal 
Tsp4 5I 
NgoGV 
Nlaiv 
BscGI | 
ECOS7I I 



BscGI 
Sthl32I | 
MboII | 



Sthl32I 
I 



II 
I I 



Rsal 



i i 
i i 



Mwol 
I 



AGATTTTACTCTTACGGGAAGTACGGGAACCGTGACCTTCAGCACAAATACAGCAAAGAC 

1981 + + + + + + 2040 

TCTAAAATGAGAATGCCCTTCATGCCCTTGGCACTGGAAGTCGTGTTTATGTCGTTTCTG 

Banll 
Bspl286I 

CviJI | Xmnl CviJI 

I I I I 

AGCAGGGGCTCTACATACTAAAGGAAATACTTCCTTTACCAAAAATAAGGCTCTTGTATT 

2041 + + + + + + 2100 

TCGTCCCCGAGATGTATGATTTCCTTTATGAAGGAAATGGTTTTTATTCCGAGAACATAA 



Hpyl78III 
Dpnl | 
Sau3AI | | 
Sfcl | | j Alwl 

II I I I I I 

TTCTGGAAATTCAGCAACAGCAACAGCAACAACAACTACAGATCAAGAAGGTTGTGGTGG 

2101 + + + + + + 2160 

AAGACCTTTAAGTCGTTGTCGTTGTCGTTGTTGTTGATGTCTAGTTCTTCCAACACCACC 



Apol 
Tsp509I 
Hpyl7 8III 




Figure 8J 



Dpnl 
Hin4I | 
Sau3AI | j 
III 
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Hpyl88IX 
AlwNI | 
BplI | j 
Hinf I j | Alul 
Hpyl88IX | j | CviJI 
Ddel || || BseMII | 
Mnll Ml || Plel | j 
I I I I II I 



Msel 
Alul | 
CviJI | 
Hindlll | j 
I I I 



1 

CO 



ro 

C3 



m 
o 

rn 

< 
m 
a 



2161 



AGCGATCCTCTGTAATATCTCAGAGTCTGACATAGCTACAAAAAGCTTAACTCTTACTGA 

+ + + + 2220 



- + 



TCGCTAGGAGACATTATAGAGTCTCAGACTGTATCGATGTTTTTCGAATTGAGAATGACT 



Msel 



Msel 



Beef I 



AAATGAGAGTTTAAGTTTCATTAACAATACGGCAAAAAGAAGTGGTGGTGGTATTTATGC 

2221 + + + + + + 2280 

TTTACTCTCAAATTCAAAGTAATTGTTATGCCGTTTTTCTTCACCACCACCATAAATACG 

BseMII 
TspRI 
Hinfl | 
Tf il | 

Ddel Ddel BtsI | | | Bed 

I I I I I 

TCCTAAGTGTGTAATCTCAGGCAGTGAATCCATAAACTTTGATGGCAATACTGCTGAAAC 

2281 + + + + + + 2340 

AGGATTCACACATTAGAGTCCGTCACTTAGGTATTTGAAACTACCGTTATGACGACTTTG 



Sthl32I Mnll 
I I 



Avail 

BseRI Sau96I 
NspV| Alul | 

Hpyl78III TaqI | TaqI CviJI Taal BsmAI 

I II I I I I 

TTCGGGAGGAGCGATTTATTCGAAAAACCTTTCGATTACAGCTAACGGTCCTGTCTCCTT 

2341 + + + + + + 2400 

AAGCCCTCCTCGCTAAATAAGCTTTTTGGAAAGCTAATGTCGATTGCCAGGACAGAGGAA 



Bpml 
Haell 
Hhal 
NgoGV 
Nlaiv 
BsaHI 

Hpyl78III Narl 
Mnll | Banl| 
Tsp509l| |MnlI Mwol ||||| | CviJI Acil Mnll 

II I I II 

TACCAATAATTCTGGAGGCAAGGGAGGCGCCATTTATATAGCCGATAGCGGAGAACTTTC 

2401 + + + + + + 2460 

ATGGTTATTAAGACCTCCGTTCCCTCCGCGGTAAATATATCGGCTATCGCCTCTTGAAAG 



JC 
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Figure 8K 



Ddel 



Bed 



Ddel 



BseMII Ddel 
NgoGV|BseMII | 
NlaIV|MnlI | | 
II III 



BsaXI 
Alol | 
Ppil| 
II 



CviJI 

III I 
CTTAGAGGCTATTGATGGGGATATTACTTTCTCAGGGAACCGAGCGACTGAGGGAACTTC 

2461 + + + + + + 2520 

GAATCTCCGATAACTACCCCTATAATGAAAGAGTCCCTTGGCTCGCTGACTCCCTTGAAG 



Dpnl 
Sau3AI | 
Taql | | 
Alwl | I I 

I II 1 



ScrFI 
BsaJI 
EcoRII 
NgoGV | 
NlalV j 
BanI | | 
MslI | | j 
I I II 



ScrFI 
AlwNI 
EcoRII 
Alul 
CviJI 
Fnu4HI 
Tsel 



Fnu4HI 
CviRI 
Tsel 
Cac8I 
Alul 
CviJI 
Hindlll | 
Dpnl | j 

Sau3AI | Ddel j | 
II III 



AACTCCCAACTCGATCCATTTAGGTGCCAGGGGCAAGATCACTAAGCTTGCAGCAGCTCC 

2521 + + + + + + 2580 

TTGAGGGTTGAGCTAGGTAAATCCACGGTCCCCGTTCTAGTGATTCGAACGTCGTCGAGG 



Acelll 
Bbvl | 
Bbvl | j 
I II 



Dpnl 
Sau3AI | 
Alwl | | 

I I I 



Alul 
CviJI 
Hin4I | 
Bed | | 
I I I 



Mnll 
SfaNI 
Hpyl78III 
BslI | 
CviRI | | 
Mnll | | 

I I I 



TGGTCATACGATTTATTTTTATGATCCTATTACGATGGAAGCTCCTGCATCTGGAGGAAC 

2581 + + ' + + + + 2640 

ACCAGTATGCTAAATAAAAATACTAGGATAATGCTACCTTCGAGGACGTAGACCTCCTTG 



BseRI Xcml 
Alul | Mnll | 

Bpml BseRI Cvi JI j Mnll | | 

II II III 

AATAGAGGAGTTAGTCATCAATCCTGTTGTCAAAGCTATTGTTCCTCCTCCCCAACCAAA 



2641 



-+ 2700 



TTATCTCCTCAATCAGTAGTTAGGACAACAGTTTCGATAACAAGGAGGAGGGGTTGGTTT 




Figure 8L 

Avail 
Sau96I 
BslI | 
PflMI j 



I I 
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BsmI 
Bce83I | 
MboII j 



Hpyl78III 
Smll | 
Cvi JI | j 
II I 



Apol 
Tsp509I 
I 



AAATGGTCCTATATAGAAGAAAAACGAATGCTCTTTGTAAGGCTCAAGAGTAAAAAATTC 

2701 + + + + + + 2 

TTTACCAGGATATATCTTCTTTTTGCTTACGAGAAACATTCCGAGTTCTCATTTTTTAAG 



ECOS7I 

Hpyl8 8IX Apol | 

Bcefl | Fnu4HI EcoRI | 
Bbvl | | Tsel| Tsp509I j 

III II II 

TAAAGGTATTCTCTCAATAGGTTCTGAAGTGCTGCCGTAGAATTCATAAATATCTC 

2761 + + + + + 2816 

ATTTCCATAAGAGAGTTATCCAAGACTTCACGACGGCATCTTAAGTATTTATAGAG 
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gure 9A 

Restriction enzyme analysis of CPN100624 (RY 64 - SEQ ID NO. 9) 

Msel 
Nlalll | 
Afllll | | Dral 
BspLUllI j I Swal 
Sspl |Nspl| Msel | 

I I II II 

TCAAATATATGAGTTTACTAACTCTGTAATATTCAACATGTTAATAAGCATATTTAAATA 

AGTTTATATACTCAAATGATTGAGACATTATAAGTTGTACAATTATTCGTATAAATTTAT 

Hpyl78III 
Apol Bfal| 
Tsp509I Psil Xbal | j Tsp509I 

I I III I 
TAAATTTATAAACTTCTAGACAACAAATTGATGATTTTTTATGACAAACTCTATTTTCAT 
61 + + + + + + 12 

ATTTAAATATTTGAAGATCTGTTGTTTAACTACTAAAAAATACTGTTTGAGATAAAAGTA 

Hhal 
TspRI 

Fokl BsmAI BtsI | 

SimI | DrdI Ddel | BseMII | | 

I I I I I III 

ATCAAAGTTTGGATGTTTATGCGACCCATTTGTCTCAGCATTTTATCCCACTGCGCTATG 

121 + + + + + + 18 

TAGTTTCAAACCTACAAATACGCTGGGTAAACAGAGTCGTAAAATAGGGTGACGCGATAC 

Hpyl78III 
Mnll | 

Hpyl78III Hpyl88IX | Bf al | 

BsmFI | Mnll | jxbal | | 

II I I I III 

TTGTTCCTTATCAGGAAATGAAGTCCCTAACCTCGCCTCTTGTCAGATGTCTAGAAAAGA 

181 + + + + + + 24 

AACAAGGAATAGTCCTTTACTTCAGGGATTGGAGCGGAGAACAGTCTACAGATCTTTTCT 
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Figure 



J 

BsmFI 
Banll 
BsaJI 
Bspl286I 
Styl 
CviJI 
Hpyl78III 
Maelll 
Hpyl88IX 
Bpml 
Alul 
CviJI 
Hindlll 
Btrl | 
Maell| BsmAlj 
Af 1III | | BsmBI | 
I II II 

CATCTCTGCTTTCCACACGTCTCCAAGCTTCCGTCTGAATGTAACTCCAGAGCCCTTGGT 

241 + + + + + + 

GTAGAGACGAAAGGTGTGCAGAGGTTCGAAGGCAGACTTACATTGAGGTCTCGGGAACCA 



300 



MboII 
Mnll | 



Hpyl78III 
Maelll 
Msel Tsp45I 
Taqll| Hinfl | 
Mnll | | Tf il | 

III I I 



ScrFI 
BsaJI | 
EcoRII | 

I I 



TTCCTCCTTTCGTCCCTCTAATCTTCTTAATGGATTCGGTCACGATATAACCCAGGACAT 

301 + + + + + + 

AAGGAGGAAAGCAGGGAGATTAGAAGAATTACCTAAGCCAGTGCTATATTGGGTCCTGTA 



360 



Mnll 
Xcml 
BslI | 
Pf 111081 | | 

Tsp509I Tsp509I Psil Mnll | j j Bed 

I I I I III I 

CACAATTACAGGAAACTCTATCAATTCTGTTATAGATTATAACTACCACTACGAGGATGG 

361 + + + + + + 

GTGTTAATGTCCTTTGAGATAGTTAAGACAATATCTAATATTGATGGTGATGCTCCTACC 



420 



Apol 
Tsp509I 

Nlalll | Msel 
CviRI | j Aflll| 
BsmI Fokl |NspI j Hpyl88IX Smll | 

I I I I I I II 

AGGCATTCTTGCATGTAAAAATTTGTTCATTTCTGAAAATAAAGGAAACTTAAGTTTTGA 

421 + + + + + + 480 

TCCGTAAGAACGTACATTTTTAAACAAGTAAAGACTTTTATTTCCTTTGAATTCAAAACT 



c. 

3 
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Figure 9C 



481 



BplI 
Hpyl78III 
Taal 
Sthl32I 
Sfcl | 
Banll 

Hpyl78III Bspl286I 
BstXI | CviJI | 
Alul Mnll | | RleAI | | | | TspRI 
CviJI Taal | | Hin4l| j || | Bpml | | | BsmI Ddel 

I III II I II I II I I I I 
AAGGAATAGCTCCCACAGTTCTGGAGGGGCTCTCTACAGTGTTCGGGAATGCTGGATTTC 
+ + + + + + 

TTCCTTATCGAGGGTGTCAAGACCTCCCCGAGAGATGTCACAAGCCCTTACGACCTAAAG 



540 



Hpyl88IX 
Hinfl | 
Tf il j 

I I 



Alul 
CviJI 
Eco57I 
Mwol 
BpulOI 
CviJI | 
Fnu4HI | | 
Taul | | 
Acil| j Ddel 
II I I 



TAAGAATCAGAACTACTCGTTTATTTCAAATGCGGCTTCCTTAGCTACTACTACAACTTC 

541 + + + + + + 600 

ATTCTTAGTCTTGATGAGCAAATAAAGTTTACGCCGAAGGAATCGATGATGATGTTGAAG 



Bf al 
CviRI 
Nlalll 

Nspl j Alul 

Hpyl78III CviJI Mwol | j CviJI Ddel 

I I I I I I I 

AGGATTTGGTGGGGCTATACATGCACTAGATAGCTATATTACAAATAACTTAGGAGAAGG 

601 + + + + + + 660 

TCCTAAACCACCCCGATATGTACGTGATCTATCGATATAATGTTTATTGAATCCTCTTCC 



Mnll Alul 
Mnll | CviJI BseRI 

BsmAI | j Hin4l| BseRI | 

II I II I Mil 

ACAATTCTTAGATAATGTCTCTAAAAATAGAGGAGGAGCTATCTATGTTGGGGTGAGTTT 

+ + + + + + 720 

TGTTAAGAATCTATTACAGAGATTTTTATCTCCTCCTCGATAGATACAACCCCACTCAAA 



Tsp509I Ddel Hin4I 



661 
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Figure 9D 



Avail 
EcoO109I 
Psp5II 

Sau96I Tthlllll 
Sse8647I Hinfl | 

HphI Ddel | Hpyl78III Tfil j 

I I i I II 

ATCAATCACAGACAACTTAGGTCCTATCGTTATCAAGAAAAATCAAACATTAGAAGATTC 

721 + + + + + + 780 

TAGTTAGTGTCTGTTGAATCCAGGATAGCAATAGTTCTTTTTAGTTTGTAATCTTCTAAG 



CviJI 
PstI 
BseRI 

MboII CviRI 
Mnll SfaNI 
Alul | Bcefl Sfcl | 

CviJI | Hin4I |BstAPl| j | | Tsp509I 
Mnll | | MboII | | Mwolj | j j Fokl | 

II I I I I II I I I II 

C AGCTTTGGAGG AGGCATC TTC TGCAGAGC CGTAAATATAGAAAGGAATTATCAAAAC AT 

781 + + + + + + 840 

GTCGAAACCTCCTCCGTAGAAGACGTCTCGGCATTTATATCTTTCCTTAATAGTTTTGTA 



Hin4I 
MboII 
Hinfl 
Bfal 
Avrll 
BsaJI 
Cjel 
Styl 
CjePI | 

Eco57I CjePI | Bsp24l|| 

III III 
CCAAATCAATGATAATGCTTCAGGACAAGGGGTGGTATATTTTCTGCCCCTAGGAGTCAT 

841 + + + + + + 

GGTTTAGTTACTATTACGAAGTCCTGTTCCCCACCATATAAAAGACGGGGATCCTCAGTA 



Hpyl7 8III 
Bsp24I | 
Cjel | 
CjePI | 



900 



HaelV 
Hin4I 

Fokl | Msel 
Dpnl | | SfaNI | 

Plel Earl Tsp509I Sau3AI | j j Acil Tsp509I | j Mill I 

I I I I I II I I II I 

TATCTCTTCAAATAAAGAAATTATAGAGATCAGCAATCACTCCGCATCCTCAATTAACAC 

901 + + + + + + 960 

ATAGAGAAGTTTATTTCTTTAATATCTCTAGTCGTTAGTGAGGCGTAGGAGTTAATTGTG 



c 
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SfaNI 
Hpyl78III | 
I I 



Acil 
Sthl32I 
Mspl | 
Neil | 
ScrFI Bsll| 
I I I 



Rsal 
Nlalll | 
I I 



Nlalll 
Hpyl78III | 
Mill I | | 
Ddel Real | j 
I III 



AGCATCAGGAAAACTATATCCCGGTGGTGGCGGTATCATGTGTACCTCCCTTAGTCATGA 

961 + + + + + + 1020 

TCGTAGTCCTTTTGATATAGGGCCACCACCGCCATAGTACACATGGAGGGAATCAGTACT 



BstZ17I 
Ecil 
Acil 
Beef I 
Bbvl | 
Fnu4HI | 
Taul I 
Fnu4HI Acil | 

Msel Tsel| Ddel I I I I I AccI 

I II I II 

GAACAATCCCAAAGGTCTTATCTTTAACAATAAAACGGCAGCACTTAGCGGCGGAGTATA 

1021 + + + + + + 1080 

CTTGTTAGGGTTTCCAGAATAGAAATTGTTATTTTGCCGTCGTGAATCGCCGCCTCATAT 



Haeiv 
Hin4I 
Dpnl 
MboII 
Bglll 
BstYI 
Sau3AI 
BSSSI | 

I I 



Acil 
Avail 
RsrII 
Sau96I 
Taal 



ECOS7I 
Msel 
Vspl 
I 



CACACGAGATCTTTCATCTTCCAAAATAACGGTCCGCACAGCATTTATTAATAACTCTGC 

1081 + + + + + + 1140 

GTGTGCTCTAGAAAGTAGAAGGTTTTATTGCCAGGCGTGTCGTAAATAATTATTGAGACG 



Banll 

Bspl286I Rsal Apol 

Hpyl78III CviJI | BseRI Seal Tsp509I 

Mnll | Hin4l| j BplI | TatI | MboII | Mnll 

II III I I I I III 

GACTTCAGGAGGGGCTCTCATCAATCTTTCTGGTATAGGAAGTACTCCTCAAAATTTCTT 

1141 + + + + + + 

CTGAAGTCCTCCCCGAGAGTAGTTAGAAAGACCATATCCTTCATGAGGAGTTTTAAAGAA 



1200 



Mnll 
PstI | 
CviRI | j 
Sf cl | | | 
I I II 



Beef I 
Msel I 



BseRI 
MboII | 
MboII | j 
I I I 



CCTCTCTGCAGACTACGGCGATATTCTATTTAACAATAATACAATCACATCTTCTTCTCC 

1201 + + + + + + 1260 

GGAGAGACGTCTGATGCCGCTATAAGATAAATTGTTATTATGTTAGTGTAGAAGAAGAGG 



C 

a- 



Figure 9F 
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Sthl32I Fnu4HI 

Mnll | Tsel| 

Mspl | | Sthl32I | | Neil 

Neil | | BstAPl| || ScrFI 

ScrFI | | CviRI || ||BsaJl| 

Bsll| | |BbvI | Mwolj || Mspl j 

II I I I I II II II 



Msel Msel 



% 



03 

m 

o 

m 



z < 
~ a 



Bfal 
I 



TCAACCCGGATATAGAAATGCACTCTATGCTGCTCCGGGGATTAACTTAAAACTAGGAGC 

1261 + + + + + + 

AGTTGGGCCTATATCTTTACGTGAGATACGACGAGGCCCCTAATTGAATTTTGATCCTCG 



1320 



Hpyl88IX 
Dpnl 
Sau3AI 
BsaBI 
Hpyl78III 
Dpnl 
Sau3AI 



Sf el 

Apol Dpnl | 

Tsp509I Sau3AI | | 

Psil | Alwl | | | 

II I III 

AAGACAGGGTTATAAAATTCTCTTTTATGATCCTATAGATCACGATCAGACGACAACAGA 

1321 + + + + + + 

TTCTGTCCCAATATTTTAAGAGAAAATACTAGGATATCTAGTGCTAGTCTGCTGTTGTCT 



Dpnl 
BstYI | 
Sau3AI 
HaelV | 
Hin4I j 
Alwl | j 
I I I 



1380 



Bsbl 
Taal 
NgoGV 



NlalV 
BanI 
BstXI | 

Tsp509I BsaJl| | 

Sfcl Msel | HphI Bccl Stylj j 

I II I I II I 

TCCTATAGTATTTAATTATGAACCCCATCACCTTGGCACCGTGTTGTTTTCCGGAATCAA 

1381 + + + + + + 1440 

AGGATATCATAAATTAATACTTGGGGTAGTGGAACCGTGGCACAACAAAAGGCCTTAGTT 



Hinf I 
Tf il 
Hpyl78III 
Mspl | 
BsaWI | j 
BspEI | | 
III 



CjePI 
MboII | 

Hinfl Apol | | Earl 

Tfil CjePI Tsp509I | | Hpyl78III 

II III I 

TGTAGATTCTAACGCAACAAATCCATTGAACTTCCTATCAAAATTTTCTAACTCTTCACG 

1441 + + + + + + 1500 

ACATCTAAGATTGCGTTGTTTAGGTAACTTGAAGGATAGTTTTAAAAGATTGAGAAGTGC 
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Figure 9G 



Dpnl 
Sau3AI 
Sthl32I 
Bbvl 



BsiHKAI 
Bspl286I 
Cac8I 



I l! I I II I I 
ACTTGAAAGGGGTGTGCTCGCTATTGAAGATCGGGCTGCTATTTCTTGCAAAACCCTATC 
1 + + + + + + l 

TGAACTTTCCCCACACGAGCGATAACTTCTAGCCCGACGATAAAGAACGTTTTGGGATAG 



MboII 
Fnu4HI | 
Cvi JI | j 
Tselj | 



CviRI 



150 



560 



Mwol 
Neil 
ScrFI 
Smal 
Mspl 
Neil 
ScrFI 
Aval 



BsaJI 
CviJI 
Haelll 
Sau96I 
Sthl32I 
Bbvl 



BsmI 
BsrI | 
Bmrl I 



Mae 1 1 



Hpyl78III 
Fnu4HI Msel | 
Tsel| Vspl | 

III I II I I 

GCAAACTGGGGGCATTCTACGTTTAGGAAACGCAGCATTAATCAGGACGAAAGGCCCGGG 

1561 + + + + + + 1620 

CGTTTGACCCCCGTAAGATGCAAATCCTTTGCGTCGTAATTAGTCCTGCTTTCCGGGCCC 



Dpnl 
Sau3AI 
Hpyl78III 

Alul MboII 
CviJI Apol CviRI Nrul 

Sthl32I |Tsp509I Msel | Thai 

II I I I I 



CviJI 
Hpyl88IX | 
I I 



AAGCTCCATAAATTTTAATGCAATCGCGATCAATCTTCCTTCTATTTTACAATCAGAAGC 

1621 + + + + + + 1680 

TTCGAGGTATTTAAAATTACGTTAGCGCTAGTTAGAAGGAAGATAAAATGTTAGTCTTCG 
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Figure 9H 



Dpnl 
NgoGV 
NlalV 

Alul Hpyl7 8III BamHI 
CviJI Acelll | BstYI 
BbvCI | BseMII | Sau3AI 
BpulOI | BstXI | Alwl | 

Ddel j Mnll | | Msel | | 

I I I I I II I I III I 

CTCAGCTCCAAAGTTCTGGATTTATCCTACATTAACAGGATCCACCTATTCTGAAGACAC 

GAGTCGAGGTTTCAAGACCTAAATAGGATGTAATTGTCCTAGGTGGATAAGACTTCTGTG 



MboII 
Hpyl88IX | 
I I 
I I 



Alwl 



Bbsl 



1681 



1740 



MboII 
I 



Eco57I 



NgoGV 
NlalV 
Avail 
Eco0109I 
PspSII 
Sau96I 
SimI 
Hpyl78III | 
Ddel | | 
I I I 



BseMII 



TTCTTCTACTATCACTCTCTCAGGACCCTTGACTTTTCTAAACGATGAAAATGAAAACCC 

1741 + + + + + + 1800 

AAGAAGATGATAGTGAGAGAGTCCTGGGAACTGAAAAGATTTGCTACTTTTACTTTTGGG 



Hpyl8 8IX 
Dpnl 
Bglll 



BstYI 
Sau3AI 
Ddel | 
Alul | | 
CviJI | j 
I I I 



Taql 



EcoRV 
BseRI | 
Mnll j 



Bcgl 
BseRI | 
I I 



Bcgl 
Taql 
Mnll | 
Mnll | j 
I I I 



CTATGATAGCTTAGATCTCTCTGAACCTCGAAAGGATATCCCCCCTCCTCTACCTCCTCG 

1801 + + + + + + 1860 

GATACTATCGAATCTAGAGAGACTTGGAGCTTTCCTATAGGGGGGAGGAGATGGAGGAGC 



CviRI 
Mnll 
Mnll | 
Maelll] | 
Tsp45l| j 
II I 



Hinf I 
Bcgl Tfil 
Clal |NspV | 
Taql j Taql |BcgI 
I I I I I 



Ddel 
Nlalll | 
CviJI | j 
I I I 



ATGTGACTGCAAAAAAATCGATACTTCGAATCTCATTGTAGAAGCCATGAACTTAGATGA 

1861 + + + + + + 1920 

TACACTGACGTTTTTTTAGCTATGAAGCTTAGAGTAACATCTTCGGTACTTGAATCTACT 
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Figure 91 

BsiHKAI 
Bspl286I 
I 



EcoRV 
I 



Hinf I 
Tf il 



Bsal 
BsmAI 



Fokl 
Pflll08I | 



Bed 



I I 



Alul 
CviJI 



GCACTATGGATATCAGGGAATCTGGTCTCCCTATTGGATGGAAACTACGACTACAACAAG 

1921 + + + + + + 1980 

, CGTGATACCTATAGTCCCTTAGACCAGAGGGATAACCTACCTTTGATGCTGATGTTGTTC 

Sfcl 
BciVI 
BsrI 
Hinf I 
BspGI 
Acelll 
Bbvl 



Mspl 
BsaWI 
Rsal | 
Taal I 



Sfcl 



Alul 
CviJI 
Fnu4HI | 
Tsel| | 
Cjel | | 
I II 



Plel 
AccI 
BsaAI | 
SnaBI | 
|MaeIl| | 
I II I 



CTCTACAGTACCGGAACAGACCAATACAAACCACAGGCAGCTCTACGTAGACTGGACTCC 

1981 + + + + + + 2040 

GAGATGTCATGGCCTTGTCTGGTTATGTTTGGTGTCCGTCGAGATGCATCTGACCTGAGG 

Maelll 
Sthl32I 
Tsp45I 

Mspl 
Neil 
ScrFI 

BslI |MaeII 
II I 



Acil 
Cjel | 
II 



ApOl 
Tsp509I 



TGTAGGATACCGCCCTAACCCGGAACGTCACGGAGAATTTATTGCTAATACCTTATGGCA 

2041 + + + + + + 2100 

ACATCCTATGGCGGGATTGGGCCTTGCAGTGCCTCTTAAATAACGATTATGGAATACCGT 



Acil 
Hinfl | 

Mwol Tfil jcjePI SfaNI Mnll Mnll 

I I I I I I I 

GTCTGCCTATAACGCTCTGTTAGGAATCCGCATCTTACCTCCACAAAACCTCAAAGAGCA 

2101 + + + + + + 2160 

CAGACGGATATTGCGAGACAATCCTTAGGCGTAGAATGGAGGTGTTTTGGAGTTTCTCGT 
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igure 9J 



Aval 
Hinfl 
Mnll | 

CviRI | | j Msel Hpyl78III 

Sthl32I j | j Tsp509I | Nrul 

Plel | j j |CviJI | | Mnll Thai Mwol 

II I I I I I I II I 

TGACCTTGAAGCCTCTCTGCAAGGACTCGGGCTTCTAATTAACCAACATAATCGCGAGGG 

2161 + + + + + + 2220 

ACTGGAACTTCGGAGAGACGTTCCTGAGCCCGAAGATTAATTGGTTGTATTAGCGCTCCC 



CjePI 
Nlalll |CviJI 
I I I 



Sthl32I 
Hpyl88IX 
BsmFI | 
CviJI | | 
Hgal | | 

I II 



Fnu4HI Bbvl 

CviJI | BbvCI | 

BscGI | CviRI j BpulOlj 

BslI | | Tselj Ddelj 

I I I II II II 

ACGCAAAGGCTTCCGAAACCATACTACGGGCTATGCAGCAACAACCTCAGCAAAAACTGC 

2221 + + + + + + 2280 

TGCGTTTCCGAAGGCTTTGGTATGATGCCCGATACGTCGTTGTTGGAGTCGTTTTTGACG 



BseMII 
Fnu4HI 
CviRI 
Tsel 
Sfcl 
BstAPI | 
Mwol j 
Mnll | 



Bfal Maell 



Hinfl 

PstI Bbvl Tfil 

III II 
AGCACGACATAGTTTCTCTTTAGGATTCGCACAAATGTTCTCCAAAACTAGAGAACGTCA 

2281 + + + + + + 2340 

TCGTGCTGTATCAAAGAGAAATCCTAAGCGTGTTTACAAGAGGTTTTGATCTCTTGCAGT 



Rsal 



Hinfl 
RleAI | 
CviRI | | 
Mnll Plel | 
I I I 



Taal 
MboII 
Taql 
BseRI | 
ECOS7I | | 
||AciI Ml 
II I III 



BsmAI 
I 



ATCTCCAAGTACGACTTCCTCCCACAACTACTTTGCAGGACTCCGCTTCGACAGTCTCCT 

2341 + + + + + + 2400 

TAGAGGTTCATGCTGAAGGAGGGTGTTGATGAAACGTCCTGAGGCGAAGCTGTCAGAGGA 



Dpnl 

Bfal Sau3AI 
Avrll| HphI | 

BsmFI BsaJI | Alul | | 

Sfcl | CviJI Stylj CviJI | | | Ndel 

III II III 

CTTCAGGGACTTCATCTCTACAGGGCTATCCCTAGGTTATAGCTACGGAGATCACCATAT 

2401 + + + + + + 2460 

GAAGTCCCTGAAGTAGAGATGTCCCGATAGGGATCCAATATCGATGCCTCTAGTGGTATA 



Hin4I 
Mnll | 
Earl | j 
Bsll| | | 
III I 
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Figure 9K 



Msel SimI CviJI Msel 

II II 
GCTTTGCCACTATACAGAAATCTTAAAAGGGTCGTCCAAAGCCTTCTTTAATAACCACAC 

2461 + + + + + + 2520 

CGAAACGGTGATATGTCTTTAGAATTTTCCCAGCAGGTTTCGGAAGAAATTATTGGTGTG 



Hpyl7 8III 
CviJI 



Bsgl 
BslI | 
PflMI j 

I I 



2521 



Hinf I 
Tf il 
Bfal 

CjePI Alul | 

CviRI | CviJI | 

Mnll | | HphI | | 

III III 
TTTGGTAGCCTCTCTAGACTGCACATTCTTACCAGCTAGAATCACCCGCACTCTCGAACT 

+ + + + + + 2580 

AAACCATCGGAGAGATCTGACGTGTAAGAATGGTCGATCTTAGTGGGCGTGAGAGCTTGA 



Bfal 
Xbal| 
II 



CjePI 
Hpyl78III 
TaqI 
Faul 
Sthl32I | 
Acil | | 

Bpml | | | 

I I II 



CviJI 
Hael 
Haelll 

StuI BstXI 
ScrFI | Bsal | 

TspRI Hhal BsaJI | | BsmAI | 

CviJI BsrDl| Cjel |EcoRII | | Mnll | | 

I II I I I I I III 

CCAGCCCTTTATCAGTGCCATTGCTCTGCGCTGTTCCCAGGCCTCGTTCCAAGAAACTGG 

2581 + + + + + + 2640 

GGTCGGGAAATAGTCACGGTAACGAGACGCGACAAGGGTCCGGAGCAAGGTTCTTTGACC 



Bed 
Bpml 
Fokl | 

Cjel Apol | | 

BsrI | Fokl Tsp509I j j 
II I II I 



Hin4I 
Dpnl 
Bglll | 
BstYI j 
Sau3AI j 
I I 



CviJI 
Mnll | 
I I 



AGACCATATAAGAAAATTCCATCCAAAACATCCCCTTACAGATCTTTCCTCTCCCATAGG 

2641 + + + + + + 2700 

TCTGGTATATTCTTTTAAGGTAGGTTTTGTAGGGGAATGTCTAGAAAGGAGAGGGTATCC 



BslI 
Pf 1MI 

Hpyl88IX Nlalll | 

I I I 

CTTCCGTTCTGAATGGAAAACTTCACATCATATCCCCATGCTATGGACTACGGAAATATC 

2701 + + + + + + 2760 

GAAGGCAAGACTTACCTTTTGAAGTGTAGTATAGGGGTACGATACCTGATGCCTTTATAG 




Figure 



9L 

Rsal 
BsaAI | 
SnaBI | 
Maell| | 
II I 
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2761 



Hpyl78III 

CjePI | Hpyl78III CjePI 

III I 
CTACGTACCTACCCTATACAGAAAAAATCCAGAAATGTTCACGACACTACTCATCAGCAA 

+ + + + + + 2820 

GATGCATGGATGGGATATGTCTTTTTTAGGTCTTTACAAGTGCTGTGATGAGTAGTCGTT 



Tsp509I 

Bbvl | 

BsmAI I CviRI 

BsmBI | Fnu4HI 

Sthl32l| j Alul | 

Nlalll Tthlllllj j CviJI | 

BsrDI | Bsbl BscGI || | Tsel | 

III I II I II 

TGGAACATGGACAACACAAGCAACTCCCGTCTCCTATAATTCCGTAGCTGCAAAAATAAA 

2821 + + + + + + 2880 

ACCTTGTACCTGTTGTGTTCGTTGAGGGCAGAGGATATTAAGGCATCGACGTTTTTATTT 



Bce83I 
CjePI | 
I I 



Maelll 
Hpyl78III | 
Smll | j 
I I I 



Ddel 
Bce83I | 
CjePI | | 
I I I 



Smll 
Alul | 
CviJI | 
BseRI | j 

III 



Acelll 



AAATACTTCCCAACTTTTCTCAAGAGTAACCTTATCCTTAGATTATTCAGCTCAAGTCTC 

2881 + + + + + + 2940 

TTTATGAAGGGTTGAAAAGAGTTCTCATTGGAATAGGAATCTAATAAGTCGAGTTCAGAG 



Mnll 
Taal 
Sfcl | 
Hindi | | 
BsmAI | | j 
I III 



2941 



BsrDI 
Hinfl 
Ddel | 
Msel Alul | | | CviRI 
BseMII | CviJI | j | Plel Msel 
II II I 

CTCGTCAACTGTAGGTCAATACCTTAAAGCTGAGAGTCATTGCACATTTTAACCACAAAG 
+ + + + + + 3000 

GAGCAGTTGACATCCAGTTATGGAATTTCGACTCTCAGTAACGTGTAAAATTGGTGTTTC 



Dpnl 
BstYI 
Sau3AI 
Alwl 



TspRI 
CviRI | 
Taal | j 
I I I 



Mmel 
MboII | 
Ddel | | 
III 



AAAACATCAAGGAATAAACAGTGCAAAATAACAGATCCCTTAGTAAATCTTCCTTCTTTG 

3001 + + + + + + 3060 

TTTTGTAGTTCCTTATTTGTCACGTTTTATTGTCTAGGGAATCATTTAGAAGGAAGAAAC 
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Figure 9M 

Tsp509I 
Msel | 
CviJI | j 
NgoGV| | j 
Nlaivj j j 
II M 

TTGGAGCCTTAATTTTAGGTAAAACTACAATA 

3061 + + + -- 3092 

AACCTCGGAATTAAAATCCATTTTGATGTTAT 




JO n 
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r igure 10A 

Restriction enzyme analysis of CPN100633 (RY 65 - SEQ ID NO, 



10) 



Msel 
Vspl 
Tsp509I | 
Msel | | 
Taal | | | 

II II 

AAACAGTTAAATAATTAATAGACAATAATCTATTCTTATTGACTTCTTTTTTTCTTGTTT 
1 + + + + + + go 

TTTGTCAATTTATTAATTATCTGTTATTAGATAAGAATAACTGAAGAAAAAAAGAACAAA 

Apol 
Tsp509I 

Msel NspV | 

Msel Mnll | Nlalll TaqI | 

I II III 

ATTAAAGTTGCTTCAACCTTATTGATTTAACGAGGAAACCATGACCATACTTCGAAATTT 

61 + + + + + + 120 

TAATTTCAACGAAGTTGGAATAACTAAATTGCTCCTTTGGTACTGGTATGAAGCTTTAAA 



Fnu4HI 
Tsel | 
PstI | 
Fnu4HI | 
CviRI | 
Tsel | 
Sfcl | | 

BspMI Mnll I I I 

Cvi JI Mwol | | | 

I II II 

TCTTACCTGCTCGGCTTTATTCCTCGCTCTCCCTGCAGCAGCACAAGTTGTATATCTTCA 

121 + + + + + + 180 

AGAATGGACGAGCCGAAATAAGGAGCGAGAGGGACGTCGTCGTGTTCAACATATAGAAGT 



Bbvl 

Bbvl | Hpyl78III 
MboII | Real | 
II II 



Ddel 
Alul | 

MslI CviJlj 
Nlalll] Bed Psil Taal Hindlll || Tsp509I 

II I I I I II I 

TGAAAGTGATGGTTATAACGGTGCTATCAATAATAAAAGCTTAGAACCTAAAATTACCTG 

181 + + + + + + 240 

ACTTTCACTACCAATATTGCCACGATAGTTATTATTTTCGAATCTTGGATTTTAATGGAC 



Hpyl78III 



Btrl 
Mae 1 1 
. Mnll 
Hpyl78III | 
Bfal| | 
Xbal | | | 



Msel 
Acll | 
Maell 



TTATCCAGAAGGAACTTCTTACATCTTTCTAGATGACGTGAGGATTTCCAACGTTAAGCA 

241 + + + + + + 300 

AATAGGTCTTCCTTGAAGAATGTAGAAAGATCTACTGCACTCCTAAAGGTTGCAATTCGT 
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Figure 10B 



8 



r - 



CD 

CD 



Hpyl78III 
Dpnl | 

Nlalll | I Dpnl 
Bell | | I Sau3AI | 

Sau3Al|| j Clal| | Hinfl 

SfaNljj j Mmel MboII Psil TaqI j | Tfil Nlalll 

III I I I I II I I I 

TGATCAAGAAGATGCTGGGGTTTTTATAAATCGATCTGGGAATCTTTTTTTCATGGGCAA 

ACTAGTTCTTCTACGACCCCAAAAATATTTAGCTAGACCCTTAGAAAAAAAGTACCCGTT 



360 



CviRI 
BstAPI | 
Mwol | 
Taal | | 
I I I 



Bbvl 
BsaJI | 
Mnll | j 

I I I 



CjePI 
Fnu4HI 

Haell 

Hhal 
TaqI I 

Tsel 
Mmel | 

II 



NspV 
TaqI 



CCGTTGCAACTTCACTTTTCACAACCTTATGACCGAGGGTTTTGGCGCTGCCATTTCGAA 

361 + + + + + + 420 

GGCAACGTTGAAGTGAAAAGTGTTGGAATACTGGCTCCCAAAACCGCGACGGTAAAGCTT 



CjePI 
BsmAI | 
Thai | 
Acil | | 
I I I 



CjePI Tsp509I 



Ddel 
HphI 
CjePI | 
I I 



Bce83I 
BbvCI | 
BpulOI j 
Ddel | 



CCGCGTTGGAGACACCACTCTCACTCTCTCTAATTTTTCTTACTTAGCGTTCACCTCAGC 

421 + + + + + + 480 

GGCGCAACCTCTGTGGTGAGAGTGAGAGAGATTAAAAAGAATGAATCGCAAGTGGAGTCG 



Mnll 
Smll 
BseMII | 
Mnll I 



Mnll 



HaelV 
Hin4I 



NgoGV 
NlalV 
Drdll | 
II 



TaqI 
Dpnl | 
Sau3AI | | Mnll 
I II I 



ACCTCTACTACCTCAAGGACAAGGAGCGATTTATAGTCTTGGTTCCGTGATGATCGAAAA 

481 + + + + + + 540 

TGGAGATGATGGAGTTCCTGTTCCTCGCTAAATATCAGAACCAAGGCACTACTAGCTTTT 



Cjel 
CjePI | 

Earl | Fnu4HI 

Bsp24I | j Alul | 

MboII Bbvl | j | Cvi JI | Hin4I 

Cjel | Acelll| Ml Tsel | Cjel | 

I I II III II I I 

TAGTGAGGAAGTGACTTTCTGTGGGAACTACTCTTCGTGGAGTGGAGCTGCGATTTATAC 

541 + + + + + + 600 

ATCACTCCTTCACTGAAAGACACCCTTGATGAGAAGCACCTCACCTCGACGCTAAATATG 



Maelll 
RleAI 
Tsp45I 
Bsp24I | 
Cjel | 
CjePI | 

I I 



\ 

y 

Q 



Figure IOC 
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Acil 
MspAlI 
Ddel | 
Faul | j 



Ddel Sthl32l| j 

Eco57I Hinf I Plel | j | 

I I II 



EcoRII 
SexAI 
BseMII 
Acil 
Mwol | 



j NgoGV | j 
|NlaIv| | 
I III 



TCCCTACCTTTTAGGTTCTAAGGCGAGTCGTCCTTCAGTAAATCTCAGCGGGAACCGCTA 

601 + + + + + + 660 

AGGGATGGAAAATCCAAGATTCCGCTCAGCAGGAAGTCATTTAGAGTCGCCCTTGGCGAT 

Haell 
Hhal 
NgoGV 
NlalV 
BsaHI 
Narl 
BanI 
Fnu4HI 

BsmAI Taul 
ScrFI | Acil | 

BslI | j CviJI BstXI | | 

II I I I II 

CCTGGTGTTTAGAGACAATGTGAGCCAAGTTTATGGCGGCGCCATATCTACCCACAATCT 

661 + + + + + + 720 

GGACCACAAATCTCTGTTACACTCGGTTCAAATACCGCCGCGGTATAGATGGGTGTTAGA 



Avail 
ECOO109I 
PspSII 
Sau96I 
Sse8647I 
Taql 
Aval 



Smll 
Xhol 
Hinfl 
Hpyl78III 
Mnll 
RleAI 
Plel | 
I I 



Btrl 
Maell ] 
Nlalll | 
Hpyl78III | 
CjePI | | 
Nlalll Real j | 
I III 

CACACTCACGACTCGAGGACCTTCGTGTTTTGAAAATAATCATGCTTATCATGACGTGAA 

721 + + + + + + 780 

GTGTGAGTGCTGAGCTCCTGGAAGCACAAAACTTTTATTAGTACGAATAGTACTGCACTT 



M O 0 
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Figure 10D 



Mnll 



Alwl 
Dpnl 
Sau3AI 
Clal 



TaqI 
Dpnl 
Hiri4I 
Sau3AI 
ScrFI 
EcoRII 



BsrDI 
CviJI | 
NgoGV| j 
Nlaivj j 

III 



Mnll 
BseRI | 
CjePI | | 
BsrDI | j j 

II I I 



Bpml 



Acil 
I 



TAGTAATGGAGGAGCCATTGCCATTGCTCCTGGAGGATCGATCTCTATATCCGTGAAAAG 

781 + + + + + + 840 

ATCATTACCTCCTCGGTAACGGTAACGAGGACCTCCTAGCTAGAGATATAGGCACTTTTC 



Hin4I 

Dpnl 
Bglll 
BstYI 
MboII 

Sau3AI j j SfaNI Fokl CjePI 

III III 
CGGAGATCTCATCTTCAAAGGAAATACAGCATCACAAGACGGAAATACAATACACAACTC 

841 + + + + + + 900 

GCCTCTAGAGTAGAAGTTTCCTTTATGTCGTAGTGTTCTGCCTTTATGTTATGTGTTGAG 



CjePI 
Msel 
Taal 
BsiHKAI 
Bspl286I 
Hpyl78III | 
Bed Xcml | | 
Bed |CviRl| | | 

I I II I I 



Hpyl78III 
Mspl 
BsaWI 
BspEI 

BsaAI Hinfl | 

Maell| Tfil | 

Bpml | j Hpyl8 8IX | j 
III III 



CATCCATCTGCAATCTGGAGCACAGTTTAAGAACCTACGTGCTGTTTCAGAATCCGGAGT 

901 + + + + + + 960 

GTAGGTAGACGTTAGACCTCGTGTCAAATTCTTGGATGCACGACAAAGTCTTAGGCCTCA 



Bsp24I 
CjePI 
Cjel 
Dpnl | 
Sau3AI | | 
Alwl | | j 

I I II 



Tsp509I 
Plel | 



Dpnl 
Cjel 
CjePI 
Bglll | 
Bsp24I | 
BstYI | 
Sau3AI | 

I I 



961 



CviJI Hinfl 

I I I 
TTATTTCTATGATCCTATAAGCCATAGCGAGTCGCATAAAATTACAGATCTTGTAATCAA 
+ + + + + + 1020 

AATAAAGATACTAGGATATTCGGTATCGCTCAGCGTATTTTAATGTCTAGAACATTAGTT 



55* 
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Figure 10E 

Hpyl78III 
Ddel 

Alul 
Bsp24I 

Cjel | | BseMII 
Tsp509I CjePI | j ScrFl| 
Hpyl78III Eco57I | CviJI j | EcoRII | | 

I I I I I I III 

TGCTCCTGAAGGAAAGGAAACTTATGAAGGAACAATTAGCTTCTCAGGACTATGCCTGGA 

1021 + + + + + + 1080 

ACGAGGACTTCCTTTCCTTTGAATACTTCCTTGTTAATCGAAGAGTCCTGATACGGACCT 



Cjel 
CjePI 
Bsp24I 
Fokl 
Nlalll 
Hpyl78III 
Real 



Dpnl | 
Bell | j 
Sau3AI j | 
I II 



Maelll 
Tsp45I 



lAcil Tsp45I Mnll 

I I I 

TGATCATGAAGTTTGTGCGGAAAATCTTACTTCCACAATCCTACAAGATGTCACATTAGC 

1081 + + + + + + 1140 

ACTAGTACTTCAAACACGCCTTTTAGAATGAAGGTGTTAGGATGTTCTACAGTGTAATCG 



BstEII 
Maelll 
BplI | 

Ppil Bed | | 

Hin4I | Hpyl88IX | j | 
II I II I 



CviRI BsmI 
Fokl I CviRI 



Msel 



AGGAGGAACTCTCTCTCTATCGGATGGGGTTACCTTGCAACTGCATTCTTTTAAGCAGGA 

1141 + + + + + + 1200 

TCCTCCTTGAGAGAGAGATAGCCTACCCCAATGGAACGTTGACGTAAGAAAATTCGTCCT 



Bpml 
Alul | 
CviJI | 
Cac8I | j 
I I I 



Drdll 
NgoGV 
NlalV 
BsmAI 
ScrFI 
EcoRII | 
BsaXI | | 

I I I 



Sthl32I 
Hpyl78III 
BpulOI | 
Ddel | 
SfaNI I 



BseMII 
Aval | 
Bsgl| | 
I I I 



AGCAAGCTCTACGCTTACTATGTCTCCAGGAACCACTCTGCTCTGCTCAGGAGATGCTCG 

1201 + + + + + + 1260 

TCGTTCGAGATGCGAATGATACAGAGGTCCTTGGTGAGACGAGACGAGTCCTCTACGAGC 



C 



Figure 10F 
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Hpyl78III 

CviRI | 

AlwNI | | 

Hinfl | | | Thai 

Tfil | | j Mnll | 

Hpyl88IX III I Hinfl | j 

Fokl | | I | I MboII Tfil | j 

I I I I I I I III 

GGTTCAGAATCTGCACATCCTGATTGAAGATACCGACAACTTTGTTCCTGTAAGGATTCG 

1261 + + + + + + 

CCAAGTCTTAGACGTGTAGGACTAACTTCTATGGCTGTTGAAACAAGGACATTCCTAAGC 



1320 



SfaNI 
BsaJI | 
Hhal | | 
III 



BsmAI 
Fokl | 



Mnll 
CjePI | 
Msel | | 
I I I 



CviJI 
Bgli | 

Mwol j 
I I 



CGCCGAGGACAAGGATGCTCTTGTCTCATTAGAAAAACTTAAAGTTGCCTTTGAGGCTTA 

1321 + + + + + + 1380 

GCGGCTCCTGTTCCTACGAGAACAGAGTAATCTTTTTGAATTTCAACGGAAACTCCGAAT 



Hinfl Mnll 

Avail Tsp509I Mnll Tfil Earl | 

Sau96I CjePI | Msel | CviJI MboII | Hpyl78III j 

I I I I I I I I II 

TTGGTCCGTCTATGACTTTCCTCAATTTAAGGAAGCCTTTACGATTCCTCTTCTTGAACT 

1381 + + + + + + 1440 

AACCAGGCAGATACTGAAAGGAGTTAAATTCCTTCGGAAATGCTAAGGAGAAGAACTTGA 



CviJI 
Haelll 
ECOO109I | 
Sau96I | 
Bfal | 



Bbsl 
MboII 



Taal 



Bfal 
Bsal 
BsmAI 
Avrll | 
BsaJI j 
Styl| 
I I 



Maelll 
Tsp45I 



TCTAGGGCCTTCTTTTGACAGTCTTCTCCTAGGGGAGACCACTTTGGAGAGAACCCAAGT 

1441 + + + + + + 1500 

AGATCCCGGAAGAAAACTGTCAGAAGAGGATCCCCTCTGGTGAAACCTCTCTTGGGTTCA 



Beef I 
I 



Hgal 
Taql 
BsmFI | 
Mnll | | 
BsaHI | j j 

I I I I 



BslI 
Alul 
BslI 
CviJI 
BpulOI 
Ddel 
NgoGV 
NlalV 
Avail | 
Sau96I I | Earl 
I 



Rsal MboII 
I I 



CACAACAGAGAATGACGCCGTTCGAGGTTTCTGGTCCCTAAGCTGGGAAGAGTACCCCCC 

1501 + + + + + + 1560 

GTGTTGTCTCTTACTGCGGCAAGCTCCAAAGACCAGGGATTCGACCCTTCTCATGGGGGG 



0 
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Mnll 
Hinf I | 

Hpyl78III Dpnl Ddel Tf il j 

BslI | Sau3AI | Alwl | Taal BseMII | j 

II I I I I I III 

TTCTCTGGATAAAGACAGAAGGATCACACCAACTAAGAAAACTGTTTTCCTCACTTGGAA 

1561 + + + + + + 1620 

AAGAGACCTATTTCTGTCTTCCTAGTGTGGTTGATTCTTTTGACAAAAGGAGTGAACCTT 



Dpnl 
Sau3AI | 

Ddel | | Msel Hinfl 

Hpyl78III | j Ddel AccI Tsp509I | Tfil 

III II III 

TCCTGAGATCACTTCTACGCCATAATCTCTAAGTCTACACTATAATTAAGGGAATCCCCT 

1621 + + + + + + 1680 

AGGACTCTAGTGAAGATGCGGTATTAGAGATTCAGATGTGATATTAATTCCCTTAGGGGA 



MboII NgoGV 

NgoGVj NlalV 

Nlaivj Avail 

Avail | | ECOO109I 

EcoO109l|| Hpyl88IX | 

PspSIljj BsmFI | PspSII 
Msel Sau96l|| BsmFI | | Sau96I 

I III I I I I 

TTAAGAAGATTTTGGGACCTATCTGTATTCAGAGATAGGTCCCTCTATGCACACATGTTC 

1681 + + + + + ' + 1740 

AATTCTTCTAAAACCCTGGATAGACATAAGTCTCTATCCAGGGAGATACGTGTGTACAAG 



BssSI 
Nlalll 
Afllll | 
BspLUllI | 
Mnll | j 
CviRI | |NspI 
III I 



Hpyl78III 
I 

ACGAG 

1741 1745 

TGCTC 
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Restriction enzyme analysis of CPN100985 (RY 66 



0 



SEQ ID NO. 11) 



Hpyl78III 
Dpnl 
Mnll 

Sau3AI | | Bpull02I 

Msel | j j Ddel BspMI 

NlalV III | CviJI | Bpml | CviRI 

I I I I I II II I 

TTTGGAACCTTAATGATCTCTGGAGGGTGGCTTAGCAATATGATTTTACGCTTTGCAGGT 

AAACCTTGGAATTACTAGAGACCTCCCACCGAATCGTTATACTAAAATGCGAAACGTCCA 



Alul 
CviJI 



Hinf I 
Tf il 



Cjel 



Hpyl88IX 

I III 
CAGATTTTCCAAAACTTCTATAAATGGAAATAAAGAGCTTATGGGAATCTCTCTACCAGA 

61 + + + + + + 120 

GTCTAAAAGGTTTTGAAGATATTTACCTTTATTTCTCGAATACCCTTAGAGAGATGGTCT 



Bfal CviJI 

Avrll| Cjel Haelll 

Alul BsaJI j Fokl | Mspl | 

CviJI Stylj Ddel Mmel j Tthlllll | |MnlI 

I II III I I I I 

GCTTTTTTCCAACCTAGGTTCTGCTTACTTAGATTATATCTTTCAACATCCTCCGGCCTA 

121 + + + + + + 180 

CGAAAAAAGGTTGGATCCAAGACGAATGAATCTAATATAGAAAGTTGTAGGAGGCCGGAT 



MboII 
BslI | 
I I 



Sthl32I 
BscGI | 
CviJI | | 
I I I 



Alul 
CviJI 
Sf cl | 
I I 



TGTTTGGTCAGTTTTTCTTCTTTTATTAGCCCGTCTGCTTCCTATTTTTGCTGTAGCTCC 

181 + + + + + + 240 

ACAAACCAGTCAAAAAGAAGAAAATAATCGGGCAGACGAAGGATAAAAACGACATCGAGG 



BslI 
Ddel | 
II 



Alul 
CviJI 
Hin4I | 
I I 



Hpyl78III 
Mnll 
Msel | 
Bpll| | 
Sthl32I | | | 

I I I I 



Cac8I 
CviJI | 



BsmAI 



I I 
I I 



CTTCTTAGGAGCAAAGCTCTTTCCCTCCCCTATTAAAATCGGGATTAGTCTCTCTTGGCT 

241 + + + + + + 

GAAGAATCCTCGTTTCGAGAAAGGGAGGGGATAATTTTAGCCCTAATCAGAGAGAACCGA 



300 



Tsp509I 
Dpnl | 
Sau3AI | 



BsaBI 



I 



Ecil 
Acil | 

CviRI BciVI | j Sau3AI | | Nlalll 

I III III I 

TGCAATCATCTTTCCAAAAGTCTTGGCGGATACGCAGATCACAAATTACATGGATAACAA 

ACGTTAGTAGAAAGGTTTTCAGAACCGCCTATGCGTCTAGTGTTTAATGTACCTATTGTT 



360 



c 

ure 11B 
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J 



Dpnl 
Bell | 

Sau3AI j CviJI 

I I I 
TCTCTTTTATGTTTTACTTGTGAAGGAGATGATCATAGGCATTGTGATAGGCTTTGTTTT 

361 + + + + + + 420 

AGAGAAAATACAAAATGAACACTTCCTCTACTAGTATCCGTAACACTATCCGAAACAAAA 



Alwl 
HaelV 
Hin4I 
Dpnl 

CviRI BstYI | 

Bbvl Fnu4H-I | Sau3AI | | | Hinfl 

Bsgl| Tselj j Mwol I I I I Tfil 

II II I I I I I I I 

AGCATTTCCCTTTTATGCTGCACAATCGGCAGGATCTTTCATCACTAACCAACAAGGGAT 

421 + + + + + + 480 

TCGTAAAGGGAAAATACGACGTGTTAGCCGTCCTAGAAAGTAGTGATTGGTTGTTCCCTA 



Fokl Hhal Nlalll 

Mnll | Thai Hin4I Acil Mnll | 

III I I II 

TCAGGGTTTAGAGGGCGCGACATCCCTGATTTCCATTGAGCAGACCTCTCCGCATGGCAT 

481 + + + + + + 540 

AGTCCCAAATCTCCCGCGCTGTAGGGACTAAAGGTAACTCGTCTGGAGAGGCGTACCGTA 



BstEII 

Hpyl78III Mae I I I 

MaeIIl| Tsp45I 
BplI Tsp45l| Taqll HphI | Taal 

I II I I I I 

TTTATACCATTACTTCGTGACTATTATTTTTTGGTTAGTGGGTGGTCACCGTATTGTAAT 

AAATATGGTAATGAAGCACTGATAATAAAAAACCAATCACCCACCAGTGGCATAACATTA 



600 



Dpnl 
Sau3AI 
Hpyl88IX| 
Alwl | | 
XmnI | j | 
II II 

CTCTTTGTTATTGCAAACTCTTGAAGTCATTCCGATCCATAGTTTCTTTCCTGCCGAGAT 

601 + + + + + + 660 

GAGAAACAATAACGTTTGAGAACTTCAGTAAGGCTAGGTATCAAAGAAAGGACGGCTCTA 



Hpyl7 8III 
CviRI | 
I I 
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Figure 11C 



Msel 
Af III | 

Smll |Bspl286I 
Alul | | Bmgl |Sthl32I 
CviJI | | BseSI | BslI | 

III II II 



661 



Hpyl78III Acelll 

Dpnl | Alul BsmAI 

Bell | j CviJI Hpyl78III 

Sau3AI j | Cac8I | BssSI | 

III II II 
GATGAGCTTAAGTGCCCCGATTTGGATTACTATGATCAAGATGTGCCAGCTCTGTCTCGT 
+ + + + + + 

CTACTCGAATTCACGGGGCTAAACCTAATGATACTAGTTCTACACGGTCGAGACAGAGCA 



720 



Mwol 
Alul 
CviJI 
PstI 
Fnu4HI 
CviRI 



Ddel 
Alul | 
CviJI | 
MspAlI j 



Mwol 
Tsel 
Sfcl | 
BsiHKAI | | 



BseMII 



PvuII |Bspl286I 
II 



I I 



I I I 



Hpyl8 8IX 
Msel | 
Bbvl | j 
I I I 



Ddel 



GATGACCATACAGCTGAGTGCTCCTGCAGCTTTGGCGATGTTAATGTCCGACCTATTCTT 

721 + + + + + + 

CTACTGGTATGTCGACTCACGAGGACGTCGAAACCGCTACAATTACAGGCTGGATAAGAA 



780 



Bgll 
Mwol 
Msel 
Af III 
Mnll | 

BseRI Smll I 

Mnll | Mnll | 

II II 
AGGGATTATTAACCGTATGGCACCTCAAGTTCAGGTCATCTACCTCCTCTCTGCCCTTAA 

781 + + + + + + 840 

TCCCTAATAATTGGCATACCGTGGAGTTCAAGTCCAGTAGATGGAGGAGAGACGGGAATT 



Taal 

Mmel | Smll 

Bce83I | | NlalV | 

Msel j | BanI | | 

III II I 



SimI 
Nlalll | 

Bbsl | | ScrFI 
MboII | | BsaJI | 

CviJI | | | HphI EcoRII |BslI 

I I II I III 

GGCTTTCATGGGTCTTCTCTTTCTCACCCTGGCGTGGTGGTTCATAATTAAGCAGATAGA 

841 + + + + + + 900 

CCGAAAGTACCCAGAAGAGAAAGAGTGGGACCGCACCACCAAGTATTAATTCGTCTATCT 



Msel 
Tsp509I | 
Drdll | j 
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Figure 11D 



Bfal 
Avrll | 
BsaJI j 

Drdll Styl j 

Tthlllll BsmFI | Bce83I | |NlaIV Smll 

I II III I I 

TTATTTCACTCTTGCTTGGTTCAAAGAAGTCCCCATTATGCTCCTAGGTTCCAACCCTCA 

901 + + + + + + 960 

AATAAAGTGAGAACGAACCAAGTTTCTTCAGGGGTAATACGAGGATCCAAGGTTGGGAGT 



Mnll 
Rsal | 
Seal j 
Tat I | | 
I I I 



CviJI 
Bfal 
Mmel 
Avrll | 
BsaJI j 
Styl | 
II 



Hpyl78III 
SfaNI 
Hinfl 
Hpyl78III 



Maelll Ml | Bpml 

Tsp45l| j | j Hhal Hinfl | 

Plel III | | Hin4I | Tf il | 

I II I I I II II 

AGTACTCTAATCCCCTAGGCTCTTATCGTGACTCTTATCTGGAGATGCGCTCACTTACGA 

961 + + + + + + 

TCATGAGATTAGGGGATCCGAGAATAGCACTGAGAATAGACCTCTACGCGAGTGAATGCT 



1020 



BplI TspRI 
Ddel | Taal | 
Cjel | j Hhal | | 

I II I II 



Cjel 
Hinfl | 
Ddel Tfil | 
I I I 



Ddel 
I 



ATCTTAGCGCACTGTTTATGGATTATCTTAGGGAATCTCTCGCATATTCTTTTGTAATCT 

1021 + + + + + + 1080 

TAGAATCGCGTGACAAATACCTAATAGAATCCCTTAGAGAGCGTATAAGAAAACATTAGA 



Hpyl78III 
Hinfl Apol | 

Tfil Tsp509I | 

I I I 

AAGAAT C TATAAATTCAAGA 

1081 + + 1100 

TTCTTAGATATTTAAGTTCT 



v. 



if 
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Figure 12A 

Restriction enzyme analysis of CPN100987 (RY 67 



SEQ ID NO. 12) 



Hinf I 

Hin4I |SfaNI BsrDI 

TspRI | |Bfal| Tsp509l| 

Bsrl Plel | j |Bpml| Hpyl78III Mnll || 

I I I I I II I III 
CCAGTGATAAAGACTCTAGTGATAAAGATGCTCCAGAAGGAAGCAATGAAATTGAGGGTG 
1 + + + + + + go 

GGTCACTATTTCTGAGATCACTATTTCTACGAGGTCTTCCTTCGTTACTTTAACTCCCAC 



Hpyl78III 



Maelll 
Tsp45I 
Ddel I 



Hpyl78III 
Bf al | 
Xbal I 



I 
I 

MslI 



Bpml 
BsaJI | 
Stylj 



61 



Bsbl 

ii i Mi i ii 

CTTAGTGACTGCCAACACTTTTGGAACTCTAGACATCTTGATGAAGCACTCCAAGGAAGA 
GAATCACTGACGGTTGTGAAAACCTTGAGATCTGTAGAACTACTTCGTGAGGTTCCTTCT 



120 



ScrFI 

EcoRII | Hinfl 

HgiEII j CjePI | 

Hin4I | j BseRI | | 

CjePI |MboII | Mnll MboII Fokl|TfiI 

I I I I I I III 

TGACCTCTCCAGGTTTCTTCCTAAAAATCTTCTTGTTGAATCTCCTCATCCCGAAGAAAT 

121 + + + + + + 180 

ACTGGAGAGGTCCAAAGAAGGATTTTTAGAAGAACAACTTAGAGGAGTAGGGCTTCTTTA 



Sthl32I 
Mnll | 
Hpyl78III | j 
I I I 



Dral 

MboII | CviJI 
Mselj Fokl| Tsp509I Nlalll 

II II I I 

CCCTTTAAAATCTTTATCTTTTACGATGAGTTGGCTACCTACAATTCATCCTTCATGGAT 

181 + + + + + + 240 

GGGAAATTTTAGAAATAGAAAATGCTACTCAACCGATGGATGTTAAGTAGGAAGTACCTA 

BsaJI 

Nlalll Styl 
BsrDI MslI |XmnI Hpyl78III Mnll|Tsp509I 

I I I I I II I 

TACCATTGCCATGAAAGAGTTCCCTCCTGAAATCCAAGGTCAATTATTAGCGTGGTTGCC 

241 + + + + + + 300 

ATGGTAACGGTACTTTCTCAAGGGAGGACTTTAGGTTCCAGTTAATAATCGCACCAACGG 



Apol 
Tsp509I 
Hpyl78III | 



CviJI 

ScrFI SfaNI | 

CviJI Hpyl78III | EcoRII | Sfcl | | MslI 

I II I I III I 

AGAGCCTTTAGTTCAAGAAATTCTACCCTTACTGCCTGGCATCTCTATAGCCCCACATCG 

301 + + r + + + + 360 

TCTCGGAAATCAAGTTCTTTAAGATGGGAATGACGGACCGTAGAGATATCGGGGTGTAGC 
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CviJT 
MboII 
NlalV 
Hpyl88IX 
RleAI 
BsiHKAI 



Bspl286I 

BseSI || IN Hpyl78III 

CviRI || III Bf al | 

ApaLI Ml | || Xbal | | 

I I I I I II III 
CTGTGCACCTTTCGGAGCCTTCTATCTTCTAGATATGCTAAGTAAAAAGATCCGTCCTTG 

361 + + + + + + 

GACACGTGGAAAGCCTCGGAAGATAGAAGATCTATACGATTCATTTTTCTAGGCAGGAAC 



Dpnl 
BstYI | 
Sau3AI | 
Ddel Alwl | | 
I I I 



I 



420 



Sf aNI 
Mwol 

XmnI BbvCI | 

Fokl | BpulOlj | BseMII 

Tsp509I MboII | | MboII CviRI Ddel | | Mnll | 

I I I I I I II I I I 

TGGAATTACAGAAGAAATCTTTCTTCCTGCATCCTCAGCAAATGCTATACTTTACTATAC 

421 + + + + + + 480 

ACCTTAATGTCTTCTTTAGAAAGAAGGACGTAGGAGTCGTTTACGATATGAAATGATATG 



AlwNI 
Avail 
ECO0109I 
PspSII 
Sau96I 
Sse8647I 



Dpnl 
Sau3AI | 
I I 



Bf al 
Avrll | 
BsaJI | 
Styl| 



Msel 

I I I 

AGGTCCTGTAAAGATCGCTTTAATCAACTGCCTAGGTCTTTATTCTATTGCTAAAGAGTT 

481 + + + + + + 540 

TCCAGGACATTTCTAGCGAAATTAGTTGACGGATCCAGAAATAAGATAACGATTTCTCAA 



MboII 

Hpyl78III Bsml | Sfcl 

I III 
GAAGCACATTCTGGATAAGGTTGTGATTGAACGAGTGAAGAATGCTCTCTCCCCTACAGA 

541 + + + + + + 600 

CTTCGTGTAAGACCTATTCCAACACTAACTTGCTCACTTCTTACGAGAGAGGGGATGTCT 



MboII 
Apol | 
Tsp509I | 

Fokl Hpyl88IX Pflll08I | | 

I I III 

GAAACTCTTTCTTACCTACTGCCAATCTCATCCGATGAAACATTTAGAAACTACGAATTT 

601 + + + + + + 660 

CTTTGAGAAAGAATGGATGACGGTTAGAGTAGGCTACTTTGTAAATCTTTGATGCTTAAA 
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Tsp509I 

SfaNI CviRI | Taal 

I II I 

TCTTTCTTCTTGGACTACTGATGCAGAATTACGACAGTTCGTTCATAAGCAAGGGTTAGA 

661 + + + + + + 720 

AGAAAGAAGAACCTGATGACTACGTCTTAATGCTGTCAAGCAAGTATTCGTTCCCAATCT 

Taqll 
BsaAI | 
SnaBI j 

Msel Maell| | 

I III 
GTTTTTAGGTAAAGCATTAACAAAAGAAAACGCTTCTTTTCTATGGTATTTTCTACGTAG 

721 + + + + + + 780 

CAAAAATCCATTTCGTAATTGTTTTCTTTTGCGAAGAAAAGATACCATAAAAGATGCATC 



Fokl 

Dral | MslI 
TaqI Msel | |NlaIII Bed | 

II I II I I II 

GTTAGATGTCGGTCGAGCATATATCGTCGAGCAGACTTTAAAAACATGGTATGACCATCC 



BsiEI 
TaqI Hin4I 



781 



- + + + + + 840 



CAATCTACAGCCAGCTCGTATATAGCAGCTCGTCTGAAATTTTTGTACCATACTGGTAGG 

Faul 

Sthl32l| Nlalll 
Bfal | | Nsil | 

BsmFI Msel Acil | j | CviRI | | Ddel Hindlll 

I I I I II I I I I I 

CTATGTGGATTATTTTAAGTCCCGCCTAGAACAATGCATGAAAGTCTTAGTGAAATAAAA 

841 + + + + + + 900 

GATACACCTAATAAAATTCAGGGCGGATCTTGTTACGTACTTTCAGAATCACTTTATTTT 



Alul Alul 
CviJI CviJI 
I I 

GCTTTATAAGTAAAGATTTAGCTTTATACAAAGTATAGAAAAATAACACG 

901 + + + + + 950 

CGAAATATTCATTTCTAAATCGAAATATGTTTCATATCTTTTTATTGTGC 



% 
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Figure 13A 

Restriction enzyme analysis of CPN100988 (ry68 



SEQ ID NO. 13) 



Maelll 

Dral | Dpnl AccI 

Maelll Msel | | Sau3AI | Nlalll | Bed Fokl 

I III II II I I 

CGATTTCGTTACCTTTAAAGTTACTTTTGATCGTCATGGTAGACGGATGGACATTACTGC 

GCTAAAGCAATGGAAATTTCAATGAAAACTAGCAGTACCATCTGCCTACCTGTAATGACG 



60 



Dral 
Msel 
Alul 
CviJI 
Dpnl 
Bell 



Sau3AI 
CviJI 
Mwol | 
BsaJI | I 
Styl | | 
I I I 



BsaAI 
Pmll 

Maell| Mwol Bfal 

AflHI || NlaIIl| Mwol Spel| 

III II I II 

TCCAAGGGCTTATGATCAGCTTTAAATAAGGACACGTGCCATGTTAGCATTTTTCGCAAC 

AGGTTCCCGAATACTAGTCGAAATTTATTCCTGTGCACGGTACAATCGTAAAAAGCGTTG 



120 



Rsal 
Seal 
Tat I | 
I I 

TAGTTTCAAATCTGTTCTTTTTGAGTACTCCTACCAATCATTATTACTTATTTTGATTGT 

121 + + + + + + 180 

ATCAAAGTTTAGACAAGAAAAACTCATGAGGATGGTTAGTAATAATGAATAAAACTAACA 



Sthl32I 

Alul 
CviJI 

NlalV Ddel | 

Banl | BccI Mnll| j 

II I II I 



Hpyl78III 
BslI | 
I I 



Dpnl 
Sau3AI | 
MboII | | 

I I I 



Acil 
Fnu4HI 

Taul 
CviJI | 
I I 



TTCGGCACCTCCCATCATCTTAGCTTCCATAGTCGGGATTATGGTTGCGATCTTCCAAGC 
AAGCCGTGGAGGGTAGTAGAATCGAAGGTATCAGCCCTAATACCAACGCTAGAAGGTTCG 



240 



Hpyl78III 
Bfal | 

Bsbl CviRI Spel| j 

I I II I 

CGCAACACAAATCCAAGAACAGACCTTCGCTTTTGCAGTCAAACTAGTCGTGATTTTTGG 

241 + + + + + + 300 

GCGTTGTGTTTAGGTTCTTGTCTGGAAGCGAAAACGTCAGTTTGATCAGCACTAAAAACC 



JCl 
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Figure 13B 



Hpyl78III 
Dpnl | 
Mnll j 
Sau3AI | j Bpull02I 

Msel | | j Ddel BspMI 

NlalV III | CviJl| Bpml | 

I I I I I II II 

AACCTTAATGATCTCTGGAGGGTGGCTTAGCAATATGATTTTACGCTTTGCAGGTCAGAT 

301 + + + + + + 360 

TTGGAATTACTAGAGACCTCCCACCGAATCGTTATACTAAAATGCGAAACGTCCAGTCTA 



Hpyl8 8IX 
CviRI | 

I I 



361 



Alul 

Alul Hinfl CviJI 
CviJI Tfil Cjel| 

II II 
TTTCCAAAACTTCTATAAATGGAAATAAAGAGCTTATGGGAATCTCTCTACCAGAGCTTT 

+ h + + + + 

AAAGGTTTTGAAGATATTTACCTTTATTTCTCGAATACCCTTAGAGAGATGGTCTCGAAA 



420 



Bf al 
Avrll | 
BsaJI | 
Styl| 
I I 



Ddel 



Cjel 
Fokl | 
Mmel | 

I I 



CviJI 
Haelll 
Mspl | BslI 
Tthlllll | |MnlI | 

Ml II 



TTTCCAACCTAGGTTCTGCTTACTTAGATTATATCTTTCAACATCCTCCGGCCTATGTTT 

421 + + + + + + 480 

AAAGGTTGGATCCAAGACGAATGAATCTAATATAGAAAGTTGTAGGAGGCCGGATACAAA 

MboII 



GGTCAGTTTTTCTTCTTTTA 

481 + + 500 

CCAGTCAAAAAGAAGAAAAT 
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Figure 14: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 14; ORF : cpnl00686 

1 MVSSPILNVP LKNHASVSGK FTHREVSKLA SDLKSGAMSF VPEVLSEETI 

51 SSDLGKKQCT QGIISACCGL AMLIVLMSVY YRFGGVIASG AVLLNLLLIW 

101 AALQYLDAPL TLSGLAGIVL AMGMAVDANV LVFERIREEF LLSQSLKKSV 

151 EKGYTKAFGA IFDSNLTTVL ASALLFFLDT GP I KGFALTL ILGIFSSMFT 

2 01 ALFMTKFFFM LWMNKTQHTQ LHMMNKF VG I KHDFLRGCKK LWAVSGSVFL 

251 LGCVALGFGA WNSVLGMDFK GGYAFTFNPK EHGISDVAQM RGKWHKLQE 

301 AGLSSRDFRI QTFGSSEKIK IYFSDKALSY TKQIRASLLK LTIMSWRYCG 

351 IWRNRPRFL YGNS KRNAKF WSKVSSKLSK KMRYQATIGL LGALAI ILLY 

4 01 VSLRFEWQYA FSAVCALIHD LLATCAVLF I AHFFLKKIQI DLQAIGALMT 

451 VLGYSLNNTL IIFDRIREDR QANLFTPMHV LVNDALQKTF SRTVMTTATT 

501 LSVLLMLLFI GGSSVFNFAF IMTIGILLGT LSSLYIAPPL LLFMVRKENR 
551 SK 




Possible T cell epitope: 
42 7 VLFIAHFFL 



Possible B cell epitope: 
4 65 RIREDRQAN 
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Figure 15: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 15; ORF : cpnl00696 

1 MS SNLHPVGG TGTGAAAPES VLNIVEEIAA SGSVTAGLQA ITSSPGMVNL 

51 LIGWAKTKFI QPIRESKLFQ SRACQITLLV LGILLWAGL ACMFIFHSQL 

101 GANAFWL IIP AAIGLIKLLV TSLCFDEACT SEKLMVFQKW AGVLEDQLDD 

151 GILNNSNKIF GHVKTEGNTS RATTPVLNDG RGTPVLSPLV SKIARV 



Possible T cell epitope: 
133 KLMVFQKWA 



Possible B cell epitope: 
163: VKTEGNTSRAT 
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Figure 16: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 16; ORF : cpnl00709 

1 MTIRILAEGL AFRYGSKGPN IIHDVSFSVY DGDFIGIIGP NGGGKS TLTM 

51 LILGLLTPTF GSLKTFPSHS AGKQTHSMIG WVPQHFSYDP CFPISVKDW 

101 LSGRLSQLSW HGKYKKKDFE AVDHALiDLVG LSDTTTTAFA HLSGGQIQRV 

151 LLARALASYP EILILDEPTT NIDPDNQQRI LSILKKLNRT CT I LMVTHDL 

2 01 HHTTNYFNKV F YMNKTLH F I GRHFDLNRP I LLSSYKNQEF SCSPH 



Possible T cell epitope: 
212 YMNKTLHFI 



Possible B cell epitopes: 
109 SWHGKYKKKDFE 
166 DEPTTNIDPDNQQR 
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Figure 17: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 17; ORF : cpnl00710 

1 MHKVIVFIFL TLYSLKSYGN DVIDKPHVLV SIAPYKFLVE QIAEETCFVY 

51 AIVTNHYDPH TYELPPQQIK ELRQGDLWFR IGEAFGKNLL EKPYMQQVDL 

101 SQNVSLIQGK PCCNQHTTNY DTHTWLS PKN LKVQVETIVT TLSKKYPQHA 

151 TLYQSNGEKL LLALDQLNEE ILTITSKAKQ RHILVSHGAF GYFCRDYNFS 

2 01 QHTIEKSSHV EPSPKDVARV FRDIEQYKIS SVILLEYSGR RSSAMLADRF 

251 HMHTVNLDPY AENVLVNLKT IATTFSSL 




Possible T cell epitope: 
12 5 WLSPKNLKV 



Possible B cell epitope: 

55 NHYDPHTYELPPQQIKELRQGD 
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Figure 18: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 18; ORF : cpnl00711 

1 MGPGSVLiSNH S KEAGG IAIN NVIIDFSEIV PTKDNATVAP PTLKLVSRTN 

51 ADSKDKIDIT GTVTLLDPNG NLYQNSYLGE DRDITLFNID NSASGAVTAT 

101 NVTLQGNLGA KKGYLGTWNL DPNSSGSKII LKWTFDKYLR WPYIPRDNHF 

151 YINSIWGAQN SLVTVNQGIL GNMLNNARFE DPAFNNFWAS AIGSFLRKEV 

2 01 SRNSDSFTYH GRGYTAAVDA KPRQEFILGA AFSQVFGHAE SEYHLDNYKH 
251 KGSGHSTQAS LYAGNIFYFP AIRSRPILFQ GVATYGYMQH DTTTYYPSIE 

3 01 EKNMANWDSI AWLFDLRFSV DLKEPQPHST ARLTFYTEAE YTRIRQEKFT 
351 ELDYDPRSFS ACSYGNLAIP TGFSVDGALA WREIILYNKV SAAYLPVILR 

4 01 NNPKATYEVL STKEKGNWN VLPTRNAARA EVSSQIYLGS YWTLYGTYTI 
451 DASMNTLVQM ANGGIRFVF 



Possible T cell epitope: 
312 WLFDLRFSV 



Possible B cell epitope: 
24 0 : ESEYHLDNYKHKGSGHST 
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Identification of T- and B-cell epitopes from the amino acid 



Figure 

sequences SEQ ID No. 19; ORF : cpnl00877 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 



MRFSLCGFPL 
QAGDVYSLTG 
GTTKEGAVLC 
NNYWRFEQN 
PLQIAVNQAE 
AIGKGGAVCC 
SISSGGPTLF 
SLPFLNGIHL 
NKEYTGTILF 
FTQSPGSHLV 
NKQISVTDSI 
FLPVSPHYGF 
WGNAVDVRSL 
SGGYVLSVNN 
TTSLGNI FRY 
YANFPMVKNS 
HGDFKETTAD 
IFRKDPSCEA 
GSIECRPHAR 



VFSFTLLSVF 
DVSISNVDNS 
CQDPQATARF 
QSKTKGGAIS 
IRFAQNTAKN 
LPTSGSSTPV 
INNISYANSQ 
LQNAKFLKLQ 
SGEKSLANDP 
LDLGTKLIAS 
ELISPTGNAY 
QGNWKLAWTG 
MQVQETHASS 
EITPKHYTSM 
AS RN PNVNVG 
WRNNCWAIEC 
GRRFSNGSLT 
ALVISGDSWL 
NYNINCGSKF 



DTSLSATTIS 
ALNKACFNVT 
SGFSTLSFIQ 
GANVTIVGNY 
GSGGALYSDG 
PIVTFSDNKQ 
NLGGAIAIDT 
ARNGYSIEFY 
RDFKSTIPQN 
KEDIAITGLA 
EDLRMRNSQT 
TGNKVGEFFW 
LQTDRGLWID 
AFSQLFSRDK 
ILSRRFLQNP 
GGSMPLLVFE 
SISVPLGIRF 
VPAAHVSRHA 
RF 



LTPEDSFHGD 
SGSVTFAGNH 
SPGDIKEQGC 
DSVSFYQNAA 
DIDIDQNAYV 
LVFERNHSIM 
GGEISLSAEK 
DPITSEADGS 
VNLSAGYLVI 
IDIDSLSSSS 
FPLLSLEPGA 
DKINYKPRPE 
GIGNFFHVSA 
DYAVSNNEYR 
LMIFHFLCAY 
NGRLFQGAIP 
EKLALSQDVL 
FVGSGTGRYH 



SQNAERSYNV 
HGLYFNNISS 
LYSKNALMLL 
TFGGAIHSSG 
LFRENEALTT 
GGGAIYARKL 
GTITFQGNRT 
TQLNINGDPK 
KEGAEVTVSK 
T AAV I KANT A 
GGSVTVTAGD 
KEGNLVPNIL 
SEDNIRYRHN 
MYLGSYLYQY 
GHATNDMKTD 
FMKLQLVYAY 
YDFSFSYIPD 
FNDYTELLCR 



Possible T cell epitope: 
14 6 ALMLLNNYV 

Possible B cell epitope: 
581 DKINYKPRPEKEG 
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Figure 20: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 20; ORF : CPN100325 

1 MPSSWKRLLQ VLSHKIAATE SGGGIYAKDI QLQALPGSFT ITDNKVETSL 
51 TTSTNLYGGG IYSSGAVTLT NISGTFGITG NSVINTATSQ DADIQGGGIY 
101 ATTSLSINQC NTPILFSNNS AATKKTSTTK QIAGGAIFSA AVTIENNSQP 
151 IIFLNNSAKS EATTAATAGN KDSCGGAIAA NSVTLTNNPE ITFKGNYAET 

2 01 GGAIGCIDLT NGSPPRKVSI ADNGSVLFQD NSALNRGGAI YGETIDISRT 
251 GATFIGNSSK HDGSAICCST ALTLAPNSQL I FENNKVTET TATTKAS INN 

3 01 LGAAI YGNNE TSDVTISLSA ENGSIFFKNN LCTATNKYCS IAGNVKFTAI 
3 51 EASAGKAISF YDAVNVPPKK QLLKS 



Possible T cell epitope: 
22 6 VLFQDNSAL 



Possible B cell 
2 57 NSSKHDG 



epitope : 
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re 21: Identification of T- and B-cell epitopes from the amino acid 



sequences SEQ ID No. 21; ORF : CPN100368 

1 MKYSLPWLLT SSALVFSLHP LMAANTDLSS SDNYENGSSG SAAFTAKETS 

51 DASGTTYTLT SDVSITNVSA ITPADKSCFT NTGGALS FVG ADHSLVLQTI 

101 ALTHDGAAIN NTNTALSFSG FSSLLIDSAP ATGTSGGKGA ICVTNTEGGT 

151 ATFTDNASVT LQKNTSEKDG AAVSAYSIDL AKTTTAALLD QNTSTKNGGA 

2 01 LCSTANTTVQ GNSGTVTFSS NTATD KGGG I YSKEKDSTLD ANTGWT F KS 

251 NTAKTGGAWS SDDNLALTGN TQVLFQENKT TGSAAQANNP EGCGGAICCY 

301 LATATDKTGL AISQNQEMSF TSNTTTANGG AIYATKCTLD GNTTLTFDQN 

351 TATAGCGGA I YTETEDFSLK GSTGTVTFST NTAKTGGALY SKGNSSLTGN 

4 01 TNLLFSGNKA TGPSNSSANQ EGCGGAILAF IDSGSVSDKT GLSIANNQEV 

451 SLTSNAATVS GGAI YATKCT LTGNGSLTFD GNTAGTSGGA IYTETEDFTL 

501 TGSTGTVTFS TNTAKTGGAL YSKGNNSLSG NTNLLFSGNK ATGPSNSSAN 

551 QEGCGGAILS FLESASVSTK KGLiW I EDNEN VSLSGNTATV SGGAIYATKC 

601 ALHGNTTLTF DGNTAETAGG AIYTETEDFT LTGSTGTVTF S TNT AKT AG A 

651 LHTKGNTS FT KNKALVFSGN SATATATTTT DQEGCGGAIL CNISESDIAT 

701 KSLTLTENES LSFINNTAKR SGGGIYAPKC VISGSESINF DGNTAETSGG 

751 AIYSKNLSIT ANGPVS FTNN SGGKGGAIYI ADSGELSLEA IDGDITFSGN 

801 RATEGTSTPN SIHLGARGKI TKLAAAPGHT IYFYDPITME APASGGTIEE 

851 LVINPWKAI VPPPQPKNGP I 



Possible T cell epitope: 
7 WLLTSSALV 



Possible B cell epitopes: 



162 QKNTSEKDG 

5 3 8 GNKATGPSNSSANQEG 
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Figure 22: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 22; ORF : CPN100624 

1 MTNSIFISKF GCLCDPFVSA FYPTALCCSL SGNEVPNLAS CQMSRKDISA 

51 FHTSPSFRLN VTPEPLVSSF RPSNLLNGFG HDITQDITIT GNSINSVIDY 

101 NYHYEDGGIL ACKNLFISEN KGNLS FERNS SHSSGGALYS VRECWISKNQ 

151 NYSFISNAAS LATTTTSGFG GAIHALDSYI TNNLGEGQFL DNVS KNRGGA 

201 IYVGVSLSIT DNLGPIVIKK NQTLEDSSFG GGIFCRAVNI ERNYQNIQIN 

251 DNASGQGWY FLPLGVIISS NKEIIEISNH SASSINTASG KLYPGGGGIM 

301 CTSLSHENNP KGLIFNNKTA ALSGGVYTRD LSSSKITVRT AFINNSATSG 

351 GALINLSGIG STPQNFFLSA DYGDILFNNN TITSSSPQPG YRNALYAAPG 

401 INLKLGARQG YKILFYDPID HDQTTTDPIV FNYE PHHLGT VLFSGINVDS 

451 NATNPLNFLS KFSNSSRLER GVLAI EDRAA ISCKTLSQTG G I LRLGNAAL 

501 IRTKGPGSSI NFNAIAINLP SILQSEASAP KFWIYPTLTG STYSEDTSST 

551 ITLSGPLTFL NDENENPYDS LDLSEPRKDI PPPLPPRCDC KKIDTSNLIV 

601 EAMNLDEHYG YQGIWSPYWM ETTTTTSSTV PEQTNTNHRQ LYVDWTPVGY 

651 RPNPERHGEF IANTLWQSAY NALLGIRILP PQNLKEHDLE ASLQGLGLLI 

701 NQHNREGRKG FRNHTTGYAA TTSAKTAARH SFSLGFAQMF SKTRERQSPS 

751 TTSSHNYFAG LRFDSLLFRD FISTGLSLGY SYGDHHMLCH YTEILKGSSK 

8 01 AFFNNHTLVA SLDCTFLPAR ITRTLELQPF ISAIALRCSQ ASFQETGDHI 

851 RKFHPKHPLT DLSSPIGFRS EWKTSHHIPM LWTTEISYVP TLYRKNPEMF 

901 TTLLISNGTW TTQATPVSYN SVAAKIKNTS QLFSRVTLSL DYSAQVSSST 

951 VGQYLKAESH CTF 




Possible T cell epitope: 
64 0 QLYVDWTPV 



Possible B cell epitopes: 
701 NQHNREGRKG FRNHTTG 
741 SKTRERQSPSTTSSHNY 
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Figure 23: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 23; ORF : CPN100633 

1 MTILRNFLTC SALFLALPAA AQWYLHESD GYNGAINNKS LEPKITCYPE 

51 GTSYIFLDDV RISNVKHDQE DAGVF INRSG NLFFMGNRCN FTFHNLMTEG 

101 FGAAI SNRVG DTTLTLSNFS YLAFTSAPLL PQGQGAIYSL GSVMIENSEE 

151 VTFCGNYSSW SGAAIYTPYL LGSKASRPSV NLSGNRYLVF RDMVSQVYGG 

2 01 AISTHNLTLT TRGPSCFENN HAYHDVNSNG GAIAIAPGGS ISISVKSGDL 
251 IFKGNTASQD GNTIHNSIHL QSGAQFKNLR AVSESGVYFY DPISHSESHK 

3 01 ITDLVINAPE GKETYEGTIS FSGLCLDDHE VCAENLTSTI LQDVTLAGGT 
351 LSLSDGVTLQ LHSFKQEASS TLTMSPGTTL LCSGDARVQN LHILIEDTDN 

4 01 FVPVR I RAED KDALVSLEKL KVAFEAYWSV YDFPQFKEAF TIPLLELLGP 
451 SFDSLLLGET TLERTQVTTE NDAVRGFWSL SWEEYPPSLD KDRR I TPTKK 
501 TVFLTWNPEI TSTP 



Possible T cell epitope: 
64 0 QLYVDWTPV 



Possible B cell epitope: 
4 82 WEEYPPSLDKDRRITPTKK 
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J 



Figure 24: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 24; ORF: cpnl00985 

1 MGISLPELFS NLGSAYLDYI FQHPPAYVWS VFLLLLARLL PIFAVAPFLG 

51 AKLFPSPIKI GISLSWLAII FPKVLADTQI TNYMDNNLFY VLLVKEMIIG 

101 IVIGFVLAFP FYAAQSAGSF ITNQQGIQGL EGATSLISIE QTSPHGILYH 

151 YFVTIIFWLV GGHRIVISLL LQTLEVIPIH SFFPAEMMSL SAPIWITMIK 

2 01 MCQLCLVMTI QLSAPAALAM LMSDLFLGII NRMAPQVQVI YLLSALKAFM 

2 51 GLLFLTLAWW FIIKQIDYFT LAWFKEVPIM LLGSNPQVL 



Possible T cell epitope: 
8 3 YMDNNLFYV 



Possible B cell epitope: 
7 8 TQ I TNYMDNN 



U 
U 

C 
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Figure 25: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 25; ORF : cpnl00987 



1 MKHSKEDDLS 
51 MKEFPPEIQG 
101 SKKIRPCGIT 
151 LDKWIERVK 
2 01 VHKQGLEFLG 
251 YFKSRLEQCM 



RFLPKNLLVE 
QLLAWLPEPL 
EEIFLPASSA 
NALSPTEKLF 
KALTKENASF 
KVLVK 



SPHPEEIPLK 
VQEILPLLPG 
NAI LYYTGPV 
LTYCQSHPMK 
LWYFLRRLDV 



SLSFTMSWLP 
ISIAPHRCAP 
KIALINCLGL 
HLETTNFLSS 
GRAY I VE QTL 



TIHPSWITIA 
FGAFYLLDML 
YSIAKELKHI 
WTTDAELRQF 
KTWYDHPYVD 



Possible T cell epitope: 
22 0 FLWYFLRRL 



Possible B cell epitope: 
1 MKHSKEDDLSR 




c 



u 
u 
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Figure 26: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 26; ORF : cpnl00988 

1 MLAFFATSFK SVLFEYSYQS LLLILIVSAP PIILASIVGI MVAIFQAATQ 
51 IQEQTFAFAV KLWIFGTLM ISGGWLSNMI LRFAGQIFQN FYKWK 

Possible T cell epitope: 
21 LLLILIVSA 

Possible B cell epitope: 
89 QNFYKWK 



